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Claim Nos.: 31 
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such an extent that no meaninglul intematxona] search can be carried out. specifically: 
Ihis claim could not be searched because i^licant did not provide a CRF. 
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Claim Nos.: 

because they are dependent claims and are not drafted in accordance with the second and diird sentences of Rule 



Box n Observations where unity of invention is lacking (Continuation of Item 2 of first sheet) 



This Intmational Searching Authority fotmd mult^le inventioos in this international E^lication, as follows: 
Please See CooUnuation 9ieet 



1 . I I As all required additional search fees vMsre timdy paid by the applicant, this intemational search report covers all 

searchable claims. 
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payment of any additional fee. 
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The special technical feature of group 4. 16 and 31 is considered in be a method of producing recombinant adenoviral particles. Each 
group contains different sequences hence the rcsulung particles would have different structures and funciioas associated with the 
particle. 

The special technical feature of group 5, 17 and 32 is considered to be a method of producing a ceUular mediated innnune response to 
the heterologous protein encoded by the differenl adenoviral vectors. Each group contains different sequences a encoding different 
protein, therefore the resulting immune response will also be differenl. 

The special technical feature of group 6, 18 and 33 is conadered to be a method of producing a ceUular mediated immune response to 
tte heterologous protein encoded by the differem adenovixaJ vectors in conjunction with immunizing the individual a DNA plasmid 
vaccine. Each method contains different sequences encoding a differenl protein, therefore the resulting immune response wUl also be 
different. 

Accordingly, groups 1-48 are not so linked by the same or corresponding technical feaaire as to form a single general inventive 
concept. 
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and AE3 , the vector coniains ihc cis-aciing packaging sequence of ihe wild ij^e 
»denovinis genome, and a tene vAiich encodes an HIV Pol protein CSEO ID NO: 1 ) 
inserted in El . 
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55 


The claim is directed to an adenoviral vector thai is at least partially deleted of AEl 
and AE3 , the vector contains the cis-acting packaging sequence of the wild type 
adenovirus genome, and a zene \vtuch encodes an HIV Pol protein (SEO ID NO: 5) 
jDsened in El . 
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55 


Tltf claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , the vecEor coatains the cis-actiqg packagii^ sequence of the wild type 
adenoinms eenoxne. and a eene v^ch encodes an HIV Pol protein (SEQ ID NO: 7j 

inserted in El. 
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57-61 


The claims are (firected to a method of making and harvesting of a recombinant 
adenoviral particle thai contains a gene encoding an HIV Pol protein. 


17 


62. 65. 66 


The claim is directed to a mediod of generating a cellular mediated inmune response 
to HIV Pol protein with the recombinant adenoviral particle. 
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63.64 


The claim is directed to a method of generating a cellular mediated immune response 
to HIV Pol protein with the recombinant adenoviral particle in addition to 
administering a DNA plasmid vaccine. 
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67-70. 72. 
73.75 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AE 1 , the vector contains the cis-aciing packaging sequence of the wild type 
adenovirus Benome. and a gene 'Mhich encodes an HIV Nef protein (SEQ ID NO: 9) 
inserted in the parallel orientation of E I . 
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67-70, 72, 
73.75 


The claims are directed to an adenoviral vector that is at least partially deleted of 
AEl, the vector contains the cis-aciing packaging sequence of the wild type 
adenovirus eenome. and a gene vibkh encodes an HIV Nef protein (SEQ ID NO: 11) 
inserted in the parallel orientation of £ 1 . 


21 


67-70. 72. 
73,75 


The daimis are directed to an adenoviral vector that is at least partially deleted of 
AEl. the vector contains the cis-acting packaging sequence of the wild type 
adenovinis eenome. and a eene vihich encodes an HIV Nef protein (SEQ ID NO: 13) 
inserted in the parallel orientation of £1 . 


22 


67-70, 72. 
73. 75 


The claims are directed to an adenoviral vector thai is at least parUally deleted of 
AEl. the vector contains ttie ds-acting packaging sequence of (he wild type 
adenovirus eenome. and a gene ^ch encodes an HIV Nef protein (SEQ ID NO: 15) 
inserted in the parallel orientation of £1. 
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Tie claim is directed to an adenoviral vector that is at least partially deleted of AEl , 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
eenonK. and a eene which encodes an HIV Nef protein (SEQ ID NO: 9) inserted in 
the antiparallel orientation of El . 
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The claim is directed to an adenoviral vector that is at least partially deleted of AEl , 
the vector coniains the cis-acting packaging sequence of the vAld type adenovirus 
genome, and a cere which encodes an HIV Nef protein (SEQ ID NO: 1 1) inserted in 
the antiparallel orientation of El. 
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The claim is directed to an adenoviral vector that is at least partially deleted of AEl. 
the vector coniains the cis-acting packaging sequence of the wild type adenovirus 
eenome. and a eene which encodes an HIV Nef protein (SEQ ID NO: 13) inserted in 
the antiparallel orientation of E 1 . 
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The clami is directed to an adenoviral vector that is at least partially deleted oi AEl . 
the vector contains the cis-acting packaging sequence of the wild type adenovirus 
oAnram^ nnH n opne uihif^h Miendfrs ftn HIV Nef nnitein fSEO ID NO* l5) inserted In 

the antiparallel orieniHiion of El. 
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The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , the vector contains the cis-acting packaging i^equence uf the wild type 
adenovirus eenome. and a eene \\^ch encodes an HIV Nef protein (SEQ ID NO: 9) 
inserted in El. 


28 


74 


The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 , the vector contains the cis> acting packaging sequence of the wild type 
adenovirus eenome. and a eene which encodes an HIV Nef protein (SEQ ID NO: i 1) 
inserted in El. 
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The claim is directed to an adenoviral vector that is at least partially deleted of AEl 
and AE3 . the vector coniains the cis-acting packaging sequence of the v»dJd type 
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The claims are (firected to a method of makhig and harvesting of a recombinant 
adenoviral particle thai contains a gene encoding an HIV Pol protein. 
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Tl£ claim is directed to a method of generating a cellular mediated ironune response 
to HIV Pol protein with the recombinant adenoviral panicle. 
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The claim is directed to a method of generaimg a cellular mediated mnmine response 
rn, fnv Pnl prntein with the recombinant adenoviral particle in addition to 
aHmmisterin^ a DNA plasmid vaccine. 
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adenovirus geoome. and a gene which encodes an HIV Nef proL ein (SEP ID NO: 13) 
inserted in El. 



The claim is directed to an adenoviral vector that Is ai least partially deleted of AEl^ 
andAE3 . the vector contains the cis-acting packaging sequence of the wUd type 



adenovirus genome . and a gene vwhich encodes an HIV Nef prote in (SEP ID NO: 15) 
inserted in EI 



The claims are cfirected to a method of makiiig and harvesting of a recnmhinant 
adenoviral particle that comams a gene encoduig an HIV Nef protein. 



The claims are directed to a method of generating a cellular mediated imiiuine 

response to HIV Nef with the recombinant adenoviral particle. 

The claims are ddrecied to a method of generating a cellular mediated immune 
response to HIV Nef with the recombinant adenoviral particle inadditioa to 
administering a DNA plasmid vaccine. 



The claim is drawn to a multivalent vaccine wherein gag. pol and ne/are expressed 

from three individual vectors. 

The claims are drawn to a multivalent vaccine wherein gag. pol and nef arc expressed 
from one individual vectors. 



The claims arc drawn to a muliivaleiu vaccine wherein gag, pol and n^/are expressed 
fro m two individual vectors, one expressing nef pol fusion and one expressing gag. 
The claims are drawn to a multivalent vaccine viierein gag, pol and /i^/are e^qiressed 
from two individual vectors, one expressing gag-pol fiision and one expressing nef. 



The claims are drawn to a multivalent vaccine x^^erein gag, pol and /i^/ are expressed 
from two individual vectors . one expressing nef-gag fusion and one expressing pol. 
The claims are drawn to a multivalent vaccine vi)erein gag, pol and nc/are expressed 

from a single vectors as a fusion protein^ 

The claims are drawn to a multivalent vaccine vAierein gag and pol arc expressed 
from two individual vectors. 



The claims are drawn to a multivalent vaccine v^erein gag and pol arc expressed 
individaally from one vector. 



The churns are drawn to a multivalent vaccine ^erem pol'anA nefzst expressed 
from two individual vectors. 



The claims are drawn to a nultivalent vaccine wherein pol and ne/are expressed 

fr om individually from one vector. 

The i-ift^mie are drawn to a multivalent vaccine wteein nef and gag are expressed 
individually from one vector 



The cUiims are drawn to a multivalent vaccine u4ierein nef and gag are expressed 
individually from one vector. 

The daiihs are drawn to a multivalent vaccme wherein gag and pol are expressed as a 
fiision protein from one vector. 

The claims are drawn to a multivalent vaccme wherein pol and iie/are expressed as a 
fusion protein from one vector 



rhe claims arc drawn to a multivalent vaccine wherein /le/and gag are expressed as a 
fiision proteui from one vector. 



The invenUons listed as Groups 1-48 do not relate to a single general inventive concept under PCT Rule 13. 1 because, under PCT 
Rule 13.2. they lack the same or corresponding special technical feamres for the foUowing reasons: 

The technical feature linking groups 1-33 appears to be a recombinant adenoviral vector wherein the adenoviral vector is at least 
Pariiallv deleted in El but the vector may contain more deletions, the vector contains wUd type sequences including packaging .^signals 

a gene encoding a heterologous HIV protein or fragments thereof. Ertl et al. (WO 96A39178) disclose a recombinant adenoviral 
^•t^ctor that U deleted in El and partially deleted in E3, the reinainder of the adenoviral vector conUiins wUd type sequences. The 
^ectiir addiUonally contains an insertion of a heterologous protein v*uch inchidbs HIV protons (see abstract and claims 1 and 5). 
■^^refore, the technical feature lixidng the inventions of groups 1-45 does not constitute a special technical feature as defrned by PCT 
*3.2, as rt does not define a contribution over the prior art. 

^^^'^ i^pecial technical feature of the following groups 1-3. 7- 1 5. 19-30 irnd 34-48 is considered tn be the combination of sequences that 

^*i>iclosed in each grtnip. see individual claim groupmgs above lor Qic dilTereni sequences. The DNA disclosed in each group is 
"^-ulc lip different .»;eq'ueni:c having a different .stniciure and dilTercni amctitm. 
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(57) Abstract: First generation adenoviral vectors and associated recombinant adenovirus-based HTV vaccines which show en- 
hanced stability and growth properties and greater cellular-mediated immunity are described within this specification. These ade- 
noviral vectors are utilized to generate and produce through cell culture various adenoviral-based HTV-l vaccines which contain 
HTV-l gag, HTV-l pol and/or HTV-l nef polynucleotide phannaceutical products, and biologically relevant modifications thereof. 
These adenovirus vaccines, when directly introduced into living vertebrate tissue, preferably a manimalian host such as a human or a 
non-human mammal of commercial or domestic veterinary importance, express the HTV I - Gag, Pol and/or Nef protein or biologically 
modification thereof, inducing a cellular immune response which specifically recognizes HTV-l. The exemplified polynucleotides 
of the present invention are synthetic DNA molecules encoding HTV-l Gag, encoding codon optimized HTV-l Pol, derivatives of 
optimized HIV-1 Pol (including constructs wherein protease, reverse transcriptase, RNAse H and integrase activity of HTV-l Pol 
is inactivated), HTV-l Nef and derivatives of optimized HTV-l Nef, including nef mutants which effect wild type characteristics of 
Nef, such as myristylation and down regulation of host CD4. The adenoviral vaccines of the present invention, when administered 
alone or in a combined modality regime, will offer a prophylactic advantage to previously uninfected individuals and/or provide 
a therapeutic effect by reducing vixal load levels within an infected individual, thus prolonging the asymptomatic phase of HIV-1 
infection. 
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TTTLE OF TEffi INVENTION 

ENHANCED BDRST GENEEIATEON ADENOVIRUS VACCINES EXPRESSING 
CODON OPTIMIZED HTVl-GAG, POL, NEF AND MODIFICATIONS 

5 CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the benefit, under 35 U.S.C. §119(e), of U.S. 
provisional applications 60/233,180, 60/279,056, and Attorney Docket 20867PV2 
(serial number unassigned), filed September 15, 2000, March 27, 2001, and 
September 7, 2001 , respectively. 

10 

STATEMENT REGARDING FEDERALLY-SPONSORED R&D 
Not Applicable 

REFERENCE TO MICROHCHE APPENDIX 
15 Not Applicable 

FIELD OF THE INVENTION 

The present invention relates to recombinant, replication-deficient first 
generation adenovirus vaccines found to exhibit enhanced growth properties and 

20 greater cellular-mediated mununity as compared to other replication-deficient vectors. 
The invention also relates to the associated first generation adenoviral vectors 
described herein, which, through the incorporation of additional 5' adenovirus 
sequence, enhance large scale production efficiency of the recombinant, replication- 
defective adenovirus described herein. Another aspect of the instant invention is the 

25 surprising discovery that the intron A portion of the human cytomegalovirus (hCMV) 
promoter constitutes a region of instability in adenoviral vector constructs. Removal 
of this region from adenoviral expression constructs results in greatly improved vector 
stability. Therefore, improved vectors expressing a transgene imder the control of an 
intron A-deleted CMV px)moter constitute a further aspect of this invention. These 

30 adenoviral vectors are useful for generating recombinant adenovirus vaccines against 
human immunodeficiency virus (HIV). In particular, the first generation adenovirus 
vectors disclosed herein are utilized to construct and generate adenovirus-based HIV- 
1 vaccines which contain HIV-1 Gag, HIV-1 Pol and/or HIV-1 Nef polynucleotide 
pharmaceutical products, and biologically active modifications thereof. Host 

35 administration of the recombinant, replication-deficient adenovirus vaccines described 
herein results in expression of HTV-l Gag, HTV-l- Pol and/or Nef protein or 
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iiDinunologically lelevant modifications thereof, inducing a cellular immune response 
which specifically recognizes HEV-l. The exemplified polynucleotides of the present 
invention are synthetic DNA molecules encoding codon optimized HIV-1 Gag, HIV-1 
Pol, derivatives of optimized HTV-l Pol (including constructs wherein protease, 
5 reverse transcriptase, RNAse H and integrase activity of HTV-l Pol is inactivated), 
HIV-1 Nef, and derivatives of optimized HTV-l Nef, including nef mutants which 
effect wild type characteristics of Nef, such as myristylation and down regulation of 
host CD4. The HIV adenovirus vaccines of the present invention, when administered 
alone or in a combined modality and/or prime/boost regimen, will offer a prophylactic 
10 advantage to previously uninfected individuals and/or provide a therapeutic effect by 
reducing viral load levels within an infected individual, thus prolonging the 
asymptomatic phase of HEV-l infection. 

BACKGROUND OF THE INVENTION 

15 Human Immunodeficiency Virus- 1 (HTV-l) is tho etiological agent of 

acquired human immune deficiency syndrome (AIDS) and related disorders. HIV-1 
is an RNA virus of the Retroviridae family and exhibits the S'LTR-gag-pol-env- 
LTR 3' organization of all retroviruses. The integrated form of HEV-l, kaown as the 
provirus, is approximately 9.8 Kb in length. Each end of the viral genome contains 

20 flanking sequences known as long terminal repeats (LTRs). The HTV genes encode at 
least nine proteins and are divided into three classes; the major structural proteins 
(Gag, Pol, and Env), the regulatory proteins (Tat and Rev); and the accessory proteins 
(Vpu, Vpr, Vif and Nef). 

The gag gene encodes a 55-kilodalton (kDa) precursor protein (p55) which is 

25 expressed fi:om the unspliced viral mRNA and is proteolytically processed by die HIV 
protease, a product of the po/ gene. The mature p55 protein products are pl7 
(matrix), p24 (capsid), p9 (nucleocapsid) and p6. 

The pal gene encodes proteins necessary for virus replication; a reverse 
transcriptase, a protease, integrase and RNAse H. These viral proteins are expressed 

30 as a Gag-Pol fusion protein, a 160 kDa precursor protein which is generated via a 
ribosomal frame shifting. The viral encoded protease pipteolytically cleaves the Pol 
polypeptide away from the Gag-Pol fusion and further cleaves the Pol polypeptide to 
the mature proteins which provide protease (Pro, PIO), reverse transcriptase (RT, 
P50), integrase (IN, p31) and RNAse H (RNAse, pl5) activities. 
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The ne/gene encodes an early accessory HIV protein (Nef) which has been 
shown to possess several activities such as down regulating CD4 expression, 
disturbing T-cell activation and stimulating HTV infectivity. 

The env gene encodes the viral envelope glycoprotein that is translated as a 
5 160-kilodalton OkDa) precursor (gpl60) and then cleaved by a cellular protease to 
yield the external 120-kDa envelope glycoprotein (gpl20) and the transmembrane 41- 
kDa envelope glycoprotein (gp41). Gpl20 and gp41 remain associated and are 
displayedx)n tiie viral particles and the surface of HIV-infected cells. 

The tat gene encodes a long form and a short form of the Tat protein, a RNA 
10 binding protein which is a transcriptional transactivator essential for HIV-1 
replication. 

The rev gene encodes the 13 kDa Rev protein, a RNA binding protein. The 
Rev protein binds to a region of the viral RNA termed the Rev response element 
(RRE). The Rev protein promotes transfer of unspliced viral RNA from the nucleus 
15 to the cytoplasm. The Rev protein is required for HTV late gene expression and in 

turn, HTV replication. 

Gpl20 binds to the CD4/chemokine receptor present on the surface of helper 
T-lymphocytes, macrophages and other target cells in addition to other co-receptor 
molecules. X4 (macrophage tropic) virus show tropism for CE)4/CXCR4 complexes 

20 while a R5 (T-cell line tropic) virus interacts with a CD4/CCR5 receptor complex. 
After gpl20 binds to CD4, gp41 mediates the fusion event responsible for virus entry. 
The virus fuses with and enters tiie target cell, followed by reverse transcription of its 
sin^e stranded RNA genome into flie double-stranded DNA via a RNA dependent 
DNA polymerase. The viral DNA, known as provirus, enters the cell nucleus, where 

25 the viral DNA directs the production of new viral RNA within the nucleus, expression 
of early and late HIV viral proteins, and subsequently the production and cellular 
release of new virus particles. Recent advances in the ability to detect viral load 
within the host shows that the primary infection results in an extremely high 
generation and tissue distribution of the virus, followed by a steady state level of virus 

30 (albeit throu^ a continual viral production and turnover during this phase), leading 
ultimately to another burst of virus load which leads to the onset of clinical ADDS. 
Productively infected cells have a half life of several days, whereas chronically or 
latentiy infected cells have a 3-week half life, followed by non-productively infected 
cells which have a long half life (over 100 days) but do not significantiy contribute to 

35 day to day viral loads seen throughout tfie course of disease. 
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Destruction of CD4 helper T lymphocytes, which are critical to immune 
defense, is a major cause of the progressive immune dysfunction that is the hallmark 
of mv infection. Hie loss of CD4 T-cells seriously impairs the body's ability to fight 
most invaders, but it has a particularly severe impact on the defenses against viruses, 
5 fungi, parasites and certain bacteria^ including mycobacteria. 

Effective treatment regimens for HIV-1 infected individuals have become 
available recently. However, &ese drugs will not have a significant impact on the 
disease in many parts of the world and they will have a minimal impact in halting the 
spread of infection within the human population. As is true of many other infectious 

10 diseases, a significant qndemiologic impact on the. spread of HIV-l infection will 
only occur subsequent to the development and introduction of ah effective vaccine. 
There are a number of factors ih&t have contributed to the lack of successful vaccine 
development to date. As noted above, it is now apparent that in a chronically infected 
person there exists constant virus production in spite of the presence of anti-HTV-l 

15 humoral and cellular immxme responses and destruction of virally infected cells. As 
in the case of other infectious diseases, the outcome of disease is the result of a 
balance between the kinetics and the magnitude of_the inmiune response and the „ . 
pathogen leplicative rate and accessibility to the immune response. Pre-existing 
immunity may be more successful with an acute infection than an evolving immune 

20 response can be with an established infection. A second factor is the considerable 
genetic variability of the virus. Although anti-HTV-l antibodies exist that can 
oeutrahze EDIY-1 infectivity in cell culture, these antibodies are generally virus 
isolate-specific in their activity. It has proven impossible to define serological 
groupings of HIV-l using traditional methods. Rather, the virus seems to define a 

25 serological '^continuum'" so that individual neutralizing antibody responses, at best, 
are effective against only a handful of viral variants. Given this latter observation, it 
would be useful to identify inununogens and related delivery technologies that are 
likely to elicit anti-HIV-I cellular immune responses. It is known that in order to 
generate CTL responses antigen must be synthesized within or introduced into cells, 

30 subsequently processed into small peptides by the proteasome complex, and 

translocated into the endoplasmic reticuluno/Golgi complex secretory pathway for 
eventual association with major histocompatibility complex (MHC) class I proteins. 
CDS'*' T lymphocytes recognize antigen in association with class I MHC via the T cell 
receptor (TCR) and the 0)8 cell surface protein. Activation of naive CD8"*' T cells 

35 into activated effector or memory cells generally requires both TCR engagement of 
antigen as described above as well as engagement of costimulatory proteins. Optimal 
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induction of CTL responses usually requires "help" in the form of cytokines from 
CD4'*' T lymphocytes which recognize antigen associated with MHC class II 
molecules via TCR and CD4 engagement 

European Patent Applications 0 638 316 (Published February 15, 1995) and 0 
5 586 076 (Published March 9, 1994), (both assigned to American Home Products 
Corporation) describe replicating adenovirus vectors carrying an HIV gene, including 
env or gag. Various treatment regimens were used with chimpanzees and dogs, some 
of which included booster adenovirus or protein plus alum treatments. 

Replication-^fectivi^ adenoviral vectors harboring deletions in the El region 
10 are known, and recent adenoviral vectors have incorporated the known packaging 
repeats into these vectors; e.g., see EP 0 707 071, disclosing, inter cdia^ an adenoviral 
vector deleted of El sequences from base pairs 459 to 3328; and U.S. Patent No. 
6,033,908, disclosing, inter alia, an adenoviral vector deleted of base pairs 459-3510. 
The packaging efficiency of adenovirus has been taught to depend on the number of 
15 incorporated individual A (packaging) repeats; see, e.g., Grable and Hearing, 1990 /. 
ViroL 64(5):2047-2056; Grable and Hearing, 1992 /. Virol 66(2):723-731. 

Larder, et al., (1987, Nature 327: 716-717) and Larder, et al., (1989, Proc. 
Natl Acad. ScL 86: 4803-4807) disclose site specific mutagenesis of HIV-1 RT and 
the effect such changes have on in vitro activity and infectivity related to interaction 
20 with known inhibitors of RT. 

Davies, et al, (1991, Science 252:, 88-95) disclose the crystal structure of the 
RNase H domain of HIV-1 Pol. 

Schatz, et al. (1989, FEBS Lett. 257: 311-314) disclose that mutations 
Glu478Gln and His539Phe in a complete fflV-l RT/RNase H DNA fragment results 
25 in defective RNase activity without effecting RT activity. 

Mizrahi, et al, (1990, Nucl Acids. Res. 18: pp. 5359-5353) disclose additional 
mutations Asp443Asn and Asp498Asn in the RNase region of the pol gene which also 
results in defective RNase activity. The authors note that the Asp498 Asn mutant was 
difficult to characterize due to instability of this mutant protein. 
30 , Leavitt, et al. (1993, /. Biol Chem. 268: 2113-2119) disclose several 

mutations, including a Asp64Val mutation, which show.diffmng effect on HEV-l 
integrase (IN) activity. 

Wiskerchen, et al. (1995, J. ViroL 69: 376-386) disclose singe and double 
mutants, including mutation of aspartic acid residues which effect HTV-l IN and viral 
35 replication functions. 
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It would be of great import in the battle against AIDS to produce a 
prophylactic- and/or therapeutic-based HIV vaccine which generates a strong cellular 
immune response against an HIV infection. The present invention addresses and 
meets these needs by disclosing a class of adenovirus vaccines which, upon host 
5 administration, express codon optimized and modified versions of the HIV-1 genes, 
gag, pol and nef. These recombinant, replication-defective adenovirus vacdnes may 
be administered to a host, such as a human, alone or as part of a combined rdodaUty 
regimen and/or prime-boost vaccination regimen with components of the present 
invention and/or a distinct viral HIV DNA vaccine, non-viral HIV DNA vaccine, HIV 
10 subunit vaccine, an HTV whole killed vaccine and/or a live attenuated HTV vaccine. 

SUMMARY OF TEffi INVENTION 

The present invention relates to enhanced replication-defective recombinant 
adenovirus vaccine vectors and associated recombinant, replication-deficient 

15 adenovirus vaccines which encode various forms of HIV-1 Gag, HIV-l Pol, and/or . 
mV-l Nef, including immuhobgically relevant modifications of HIV-l Gag, HTV-l 
Pol and HIV-1 Nef. The adenovirus vaccines of the present invention express HFV- 
antigens and provide for improved cellular-mediated iimnune responses upon host 
administration. Potential vaccinees include but are not limited to primates and 

20 especially humans and non-human primates, and also include any non-human 
manmial of commercial or domestic veterinary importance. An effect of the 
improved recombinant adenovirus-based vaccines of the present invention should be a 
lower transmission rate to previously uninfected individuals (i.e., prophylactic 
apphcations) and/or reduction in the levels of the viral loads within an infected 

25 individual (i.e., therapeutic applications), so as to prolong the asymptomatic phase of 
HIV-1 infection. In particular, the present invention relates to adenoviral-based 
vaccines which encode various forms of codon optimized HTV-l Gag (including but 
in no way limited to p55 versions of codon optimized fuU length (FL) Gag and tPA- 
Gag fusion proteins), HIV-1 Pol, HIV-l Nef, and selected modifications of 

30 immunological relevance. The administration, intracellular delivery and expression 
of these adenovirus vaccines elicit a host CTL and Th response. The preferred 
replication-defective recombinant adenoviral vaccine vectors include but are not 
limited to synthetic DNA molecules which (1) encode codon optimized versions of 
wild type HTV-l Gag; (2) encode codon optimized versions of HTV-l Pol; (3) encode 

35 codon optimized versions of HTV-l Pol fusion proteins; (4) encode codon optimized 
versions of modified HTV-l Pol proteins and fusion proteins, including but not limited 
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to pal modifications involving residues within the catalytic regions responsible for 
RT, RNase and IN activity within the host cell; (5) encode codon optimized versions 
of wild type HIV-1 Nef; (6) codon optimized versions of HIV-1 Nef fusion proteins; 
and/or (7) codon optimized versions of HIV-1 Nef derivatives, including but not 
5 limited to ne/ modifications involving introduction of an amino-terminal leader 
sequence, removal of an amino-terminal myristylation site and/or introduction of 
dileucine motif mutations. The Nef-based fusion and modified proteins, disclosed 
within this specification and expressed from an adenoviral-based vector vaccine this 
specification, may possess altered trafficking and/or host cell function while retaining 

10 the abihty to be properly presented to the host MHC I complex and in turn eUcit a 
host CTL and Th response. Examples of HIV-1 Gag, Pol and/or Nef fusion proteins 
include but are not limited to fusion of a leader or signal peptide at the NHa-teriminal 
portion of the viral antigen coding region. Such a leader peptide includes but is not 
limited to a tPA leader peptide. 

15 The adenoviral vector utilized in construction of the HTV-l Gag-, HIV-1 Pol- 

and/or HIV-1 Nef- based vaccines of die present invention may comprise any 
replication-defective adenoviral vector which provides for enhanced genetic stability 
of the recombinant adenoviral genome through large scale production and purification 
of the recombinant virus. In other words, an HIV-1 Gag-, Pol- or Nef-based 

20 adenovirus vaccine of the present invention is a purified recombinant, replication- 
defective adenovirus which is shown to be genetically stable through multiple 
passages in cell culture and remains so during large scale production and purification 
procedures. Such a recombinant adenovirus vector and harvested adenovirus vaccine 
lends itself to large scale dose filling and subsequent worldwide distribution 

25 procedures which will be demanded of an efficacious monovalent or multivalent HTV 
vaccine. The present invention meets this basic requirement with description of a 
replication-defective adenoviral vector and vectors derived therefrom, at least 
partially deleted in El, comprising a wildtype adenovirus cw-acting packaging region 
from about base pair 1 to between from about base pair 342 (more preferably, 400) to 

30 about base pair 458 of the wildtype adenovmis genome, A preferred embodhnent of 
the instant invention comprises base pairs 1-450 of a wildtype adenovirus. In other 
preferred embodiments, the replication -defective adenoviral vector has, in addition 
thereto, a region 3' to the El-deleted region comprising base pairs 3511-3523. 
Basepairs 342-450 (more particularly, 400-450) constitute an extension of tiie 

35 5'region of previously disclosed vectors carrying viral antigens, particularly HTV 
antigens (see, e.g., PCT International Application PCT/USOO/18332. published 
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January 11, 2001 (WO 01/02067), which claims priority to U.S. Provisional 
Application Serial Nos. 60/142,631 and 60/148,981, filed 7/6/1999 and 8/13/1999. 
respectively; these documents herein incorporated by reference. Applicants have 
found that extending the 5* region further into the El gene into the disclosed vaccine 
5 vectors incorporated elements found to be important in optimizing the packaging of 
the virus. 

As compared to previous vectors not comprising basepairs from about 1 to 
between £tx)m about base pair 342 (more preferably, 400) to about base pair 458 of the 
wildtype adenovirus genome, vectors comprising the above region exhibited enhanced 

10 growth characteristics, with approximately S-10 fold greater amplification rates, a 
more potent virus effect, allowing lower doses of virus to be used to generate 
equivalent immunity; and a greater cellular-mediated immune response than 
replication-deficient vectors not comprising this region (basepairs 1-450). Even more 
important, adenoviral constructs derived therefrom are very stable genetically in 

15 large-scale production, particularly those comprising an expression cassette under the 
control of a hCMV promoter devoid of intcon A. This is because Applicants have 
surprisingly found that the iDtron A portion of the hCMV promoter constituted a 
region of instability when employed in adenoviral vectors. Applicants have, 
therefore, identified an enhanced adenoviral vector which is particularly suited for use 

20 in gene therapy and nucleotide-based vaccine^ vectors which, favorably, lends itself to 
large scale propagation. 

A preferred embodiment of this invention is a replication-defective adenoviral 
vector in accordance with the above description wherein the gene is inserted in the 
form of a gene expression cassette comprising (a) a nucleic acid encoding a protein or 

25 biologically active and/or immunologically relevant portion thereof; (b) a 

heterologous promote operatively linked to the nucleic acid of part a); and, (c) a 
transcription terminator. 

In preferred embodiments, the El gene, other than that contained within 
basepairs 1-450 or, altematively, that contained within base pairs 1-450 and 3511- 

30 3523 has been deleted from the adenoviral vector, and the gene expression cassette 
. has replaced the deleted El gene. In other preferred embodiments, the replication 
defective adenovirus genome does not have a functional E3 gene, or the E3 gene has 
been deleted. Most preferably, the E3 region is present within the adenoviral genome. 
Further preferred embodiments are wherein the gene expression cassette is in an El 

35 anti-parallel (transcribed in a 3' to 5* direction relative to the vector backbone) 
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orientation or, more preferably, an El parallel (transcribed in a 5* to 3' direction 
relative to the vector backbone) oriratation. 

Further embodiments relate to a shuttle plasmid vector comprising: an 
adenoviral portion and a plasmid portion, wherein said adenovirus portion comprises: 
5 a) a replication defective adenovirus genome, at least partially deleted in El , 

comprising a wildtype adenovirus cw-acting packaging region from about base pair 1 
to between from about base pair 342 (more preferably, 400) to about base pair 458 
(preferably, 1-450) of the wildtype adenovirus genome and, preferably, in addition 
thereto, basepairs 3511-3523 of a wildtype adenovirus sequence; and b) a gene 

10 expression cassette comprising: (a) a nucleic acid encoding a protein or biologically 
active and/or immunologically relevant portion thereof; (b) a heterologous promoter 
operatively linked to the nucleic acid of part a);and (c) a transcription terminator 
and/or a polyadenylation site. 

Oth^ aspects of this invration include a host cell comprising said adenoviral 

15 vectors and/or said shuttle plasmid vectors; vaccine compositions comprising said 
vectors; and methods of producing the vectors comprising (a) introducing the 
adenoviral vector into a host cell which expresses adenoviral El protein, and (b) 
harvesting the resultant adenoviral vectors. 

To this end, the present invention particularly relates to harvested 

20 recombinant, replication defective virus derived from a host cell, such as but not 
limited to 293 cells or PER.C6® cells, including but not limited to harvested virus 
related to any of the MRKAdS vector backbones, with or without an accompanying 
transgene, including but not limited to the HTV-l antigens described herein. An 
HIV-l vaccine is represented by any harvested, recombinant adenovirus material 

25 which expresses any one or more of the HTV-l antigens disclosed herein. This 
harvested material may then be purified, formulated and stored prior to host 
administration. 

Another aspect of this invention is a method of generating a cellular immune 
response against a protein in an individual comprising administering to the individual 
30 an adenovirus vaccine vector comprising: 

a) a recombinant, replication defective adenoviral vector, at least partially 
deleted in El, comprising a wildtype adenovirus cw-acting adenovirus packaging 
region bom about base pair 1 to between from about base pair 342 (more preferably, 
400) to about base pair 458 (preferably, 1-450) and, preferably in addition thereto, 
35 base pairs 351 1-3523 of a wildtype adenovirus sequence, and. 
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b) a gene expression cassette compiising:(i) a nucleic acid oicoding a protein 
or biologically active and/or inimunologically relevant portion thereof; (ii) a 
heterologous promoter operatively linked to the nucleic acid of part a); and (iii) a 
transcription t^minator and/or a polyadenylation site. 
5 In view of the efficacious nature of the adenoviral and/or DNA plasmid 

vaccines described herein, the present invention relates to all methodology regarding 
administration of one or more of these adenoviral and/or DNA plasmid vaccines to 
provide effective immunoprophylaxis, to prevent establishment of an HIV-1 infection 
following exposure to this virus, or as a post-HIV infection therapeutic vaccine to 

10 mitigate the acute HlV-l infection so as to result in the establishment of a lower virus 
load with beneficial long term consequences. As discussed herein, such a treatment 
regimen may include a monovalent or multivalent composition, various combined 
modality applications, and/or a prime/boost regimen to as to optimize antigen 
expression and a concomitant cellular-mediated and/or humoral immune response 

15 upon inoculation into a living vertebrate tissue. Therefore, the present invention 

provides for methods of using the adenoviral and/or DNA plasmid vaccines disclosed 
herein within the various parameters disclosed herein as well as any additional - 
parameters known in the art, which, upon introduction into mammalian tissue induces 
intracellular expression of the gag, pol and/or nef-based vaccines. 

20 To this end, the present invention relates in part to methods of generating a 

cellular immune response in a vaccinee, preferably a hiunan vaccinee, wherein the 
individual is given more than one administration of adenovirus vaccine vector, and it 
may be given in a regimen accompanied by the adnoinistration of a plasmid vaccine. 
The plasmid vaccine (also referred to herein as a "DNA plasmid vaccine" or "vaccine 

25 plasmid" comprises a nucleic acid encoding a protein or an immunolpgically relevant 
poition thereof, a heterologous promoter operably linked to the nucleic add sequence, 
and a transcription terminator or a polyadenylation signal (such as bGH or SPA, 

4 

respectively). There may be a predetermined minimum amount of time separating the 
administrations. The individual can be given a first dose of plasmid vaccine, and then 

30 a second dose of plasmid vaccine. Alternatively, the individual may be given a first 
dose of adenovirus vaccine, and then a second dose of adenovirus vaccine. In other 
embodiments, the plasnoid vaccine is administered first, followed after a time by 
administration of the adenovirus vaccine. Conversely, the adenovirus vaccine may be 
administered first, followed by administration of plasmid vaccine after a time. In 

35 these embodiments, an individual may be given multiple doses of the same 

adenovirus serotype in either viral vector or plasmid form, or the virus may be of . ■ 
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differing serotypes. In the alternative, a viral antigen of interest can be first delivered 
via a viral vaccine other than an adenovims-based vaccine, and then followed with the 
adenoviral vaccine disclosed. Alternative viral vaccines include but are not limited to 
pox vims and Venezuelan equine encephilitis virus. 
5 The present invention also relates to multivalent adenovirus vaccine 

compositions which comprise Gag, Pol and Nef components described herein; see, 
e.g.. Example 29 and Table 25. Such compositions will provide for an enhanced 
cellular immune response subsequent to host administration, particularly given the 
genetic diversity of human MHCs and of circulating virus. Examples, but not 

10 limitations, include MRKAdS-vector based multivalent vaccine compositions which 
provide for a divalcait (i.e., gag and nef, gag and pol, or pol and nef components) or a 
trivalent vaccine (i.e., gag, pol and nef components) composition. Such a mutlivalent 
vaccine may be filled for a single dose or may consist of multiple inoculations of each 
individually filled component; and may in addition be part of a prime/boost regimen 

15 with viral or non-viral vector vaccines as introduced in the previous paragraph. To 
this end, preferred compositions are MRKAd5 adenovirus used in combination with 
multiple, distinct HIV antigen classes. Each HIV antigen class is subject to sequence 
manipulation, thus providing for a multitude of potential vaccine combinations; and 
such combinations are within the scope of the present invention. The utilization of 

20 such combined modalities vaccine formulation and administration increase the 

probability of eliciting an even more potent cellular immune response when compared 
to inoculation with a single modality regimen. 

The concept of a "combined modality" as disclosed herein also covers the 
alternative mode of administration whereby multiple HIV-1 viral antigens may be 

25 ligated into a proper shutde plasmid for generation of a pre-adenoviral plasmid 
comprising multiple open reading frames. For example, a trivalent vector may 
comprise a gag-pol-nef fusion, in either a E3(-) or E3(+) background, preferably a E3 
deleted backbone, or possibly a "2+1" divalent vaccine, such as a gag-pol fusion (i.e., 
codon optimized p55 gag and inactivated optimized pol; Example 29 and Table 25) 

30 within the same MRKAdS backbone, with each open reading firame being operatively 
linked to a distinct promoter and transcription termination sequence. Alternatively, the 
two open reading frames may be operatively linked to a single promoter, with the 
open reading fiames operatively linked by an internal ribosome entry sequence 
(IRES). Therefore, a multivalent vaccine delivered as a single, or possibly a second 

35 harvested recombinant, replication-deficient adenovirus is contemplated as part of the 
present invention. 



11 



wo 02/22080 



PCT/USOl/28861 



Therefore, the admoviral vaccines and plasmid DNA vaccines of this 
invention may be administered alone, or may be part of a prime and boost 
administration regimen. A mixed modality piiming and booster inoculation scheme 
will result in an enhanced immune response, particularly if pre-existing anti-vector 
5 immune responses are present This one aspect of this invention is a method of 
priming a subject with the plasmid vaccine by adnodnistering the plasnodd vaccine at 
least one time, allowing a predet^mined length of time to pass, and then boosting by 
administering the adenoviral vaccine. Multiple primings typically, 1-4, are usually 
employed, although more may be used. The length of time between priming and 

10 boost may typically vary fix)m about four months to a year, but other time frames may 
be used. In experiments with rhesus monkeys, the animals were primed four times 
with plasmid vaccines, then were boosted 4 months later with the adenoviral vaccine* 
Their cellular immune response was notably higher than that of animals which had 
only received adenoviral vaccine. The use of a priming regimen may be particularly 

15 preferred in situations where a person has a pre-existing anti-adenovirus immune 
response. ' 

It is an object of the present invention to provide for enhanced 
replication-defective recombinant adenoviral vaccine vector backbones. These 
recombinant adenoviral backbones may accept one or more transgenes, which may be 

20 passaged through ceU culture for growth, amplification and harvest 

It is a further object to provide for enhanced replication-defective recombinant 
adenoviral vaccine vectors which encode various transgenes. 

It is also an object of the present invention to provide for a harvested 
recombinant, replication-deficient adenovirus which shows enhanced growth and 

25 amplification rates while in combination with increased virus stability after 

continuous passage in cell culture. Such a recombinant adenovirus is particularly 
suited for use in gene therapy and nucleotide-based vaccine vectors which, favorably, 
lends itself to large scale propagation. 

To this end, it is an object of the present invention to provide for (1) enhanced 

30 replication-defective recombinant adenoviral vaccine vectors as described herein 
. which encode various forms of HIV-1 Gag, HIV-l Pol, and/or HIV-1 Nef, including 
immunologically relevant modifications of HIV-1 Gag, HIV-1 Pol and HTV-l Nef, 
and (2) harvested, purified recombinant replication-deficient adenovirus generated by 
passage of the adenoviral vectors of (1) through one or multiple passages through cell 

35 culture, including but not limited to passage through 293 cells or PER.C6® cells. 
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It is also an object of the present invention to provide for recombinant 
adenovirus harvested by one or multiple passages through cell culture. As relating to 
recombinant adenoviral vaccine vector, this recombinant virus is harvested and 
formulated for subsequent host administration. 
5 It is also an object of the present invention to provide for replication-defective 

adenoviral vectors wherein at least one gene is inserted in the form of a gene 
expression cassette comprising (a) a nucleic acid encoding a protein or biologically 
active and/or inmmnologically relevant portion thereof; (b) a heterologous promoter 
operatively linked to the nucleic acid of part a); and, (c) a transcription terminator. 

10 It is also an object of the present invention to provide for a host cell 

comprising said adenoviral vectors and/or said shuttle plasmid vectors; vaccine 
compositions comprising said vectors; and methods of producing the vectors 
comprising (a) introducing the adenoviral vector into a host cell which expresses 
adenoviral El protein, and (b) harvesting the resultant adenoviral vectors. 

15 It is a further object of the present invention to provide for methods of generating a 
cellular immune response against a protein in an individual comprising administering 
to the individual an adenovirus vaccine vector comprising a) a replication defective 
adenoviral vector, ai least partially deleted in El, comprising a wildtype adenovirus 
d^-acting packaging region fiom about base pair 1 to between from about base^ pair 

20 342 (more preferably, 400) to about 450 (preferably, 1-450) and, preferably, 3511- 
3523 of a wildtype adenovirus sequence, and, b) a gene expression cassette 
comprising:(i) a nucleic acid encoding a protein or biologically active and/or 
immunologically relevant portion thereof; (ii) a heterologous promoter operatively 
linked to the nucleic acid of part a); and (iii) a transcription terminator and/or a 

25 polyadenylation site. 

It is also an object of the present invention to provide various alternatives for 
vaccine administration regimes, namely adnainistration of one or more adenoviral 
and/or DNA plasmid vaccines described herein to provide effective 
immunoprophylaxis for uninfected individuals or a therapeutic treatment for HTV 

30 infected patients. Such processes include but are not limited to multivalent HIV-1 
vaccine compositions, various combined modality regimes as well as various . 
prime/boost alternatives. These methods of administration, relating to vaccine 
composition and/or scheduled administration, will increase the probability of eliciting 
an even more potent cellular immune response when compared to inoculation with a 

35 single modality regimen. 
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As used throughout the specification and claims, the following definitions and 
abbreviations are used: 

"HAART" refers to — highly active antibretroviral therapy — . 

"first generation*' vectors are characterized as being replication-defective. 
5 They typically have a deleted or inactivated El gene region, and preferably have a 
deleted or inactivated E3 gene region as well. 

"AEX" refers to Anion Exchange chromatography. 

"QPA" refers to Quick PCR-based Potency Assay. 

**bps" refers to basepairs. 
10 "s" or "str", denotes that the transgene is in the El parallel or "straight" 

orientation. 

*TBMCs" refers to peripheral blood monocyte cells. 
"FL" refers to full length. 

"FLgag" refers to a full-length optimized gag gene, as shown in Figure 2. 
15 "Ad5-Flgag" refers to an adenovirus serotype 5 replication deficient virus 

which catties an expression cassette which comprises a full length optimized gag gene 
under the control of a CMV promoter. 

"Promoter" means a recognition site on a DNA strand to which an RNA 
polymerase binds. The promoter forms an initiation complex with RNA polymerase 
20 to initiate and drive transcriptional activity. The complex can be modified by 
activating sequences such as enhancers or inhibiting sequences such as silencers. 

"Leader" means a DNA sequence at the S' end of a stmctural gene which is 
transcribed along with the gene. This usually results a protein having an N-terminal 
peptide extension, often referred to as a pro-sequences. 
25 "Intron" means a section of DNA occurring in the middle of a gene which 

does not code for an amino acid in the gene product The precursor RNA of the intron 
is excised and is therefore not transcribed into mRNA not translated into protein. 

'Immunologically relevant" or **biologically active" means (1) with regards to 
a viral protein, that the protein is capable, upon administration, of eliciting a 
30 measurable immune response within an individual sufficient to retard the propagation 
and/or spread of the virus and/or to reduce the viral load present within the individual; 
or (2) with regards to a nucleotide sequence, that the sequence is capable of encoding 
for a protein capable of the above. 

"Cassette" refers to a nucleic acid sequence which is to be expressed, along 
35 with its transcription and translational control sequences. By changing the cassette, a 
vector can express a different sequence. 
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'1>GHpA'* refers to the bovine growth hormone transcription 
terminator/polyadenylation sequence. 

"tPAgag" refers to a fusion between the leader sequence of the tissue 
plasnodnogen activator leader sequence and an optimized HIV gag gene, as 
5 exemplified in Figure 30A-B, whether in a DNA or adenovirus-based vaccine vector. 

Where utilized, "lA" or "inact" refers to an inactivated version of a gene (e.g. 

lApol). 

"MCS" is "multiple cloning site". 

In general, adenoviral constructs, gene constructs are named by reference to 
10 the genes contained therein. For example: 

"Ad5 HTV-l gag", also referred to as the original HTV-l gag adenoviral 
vector, is a vector containing a transgene cassette composed of a hCMV intron A 
promoter, the full length version of the human codon-optimized HIV-1 gag gene, and 
the bovine growth hormone polyadenylation signal. The transgene was inserted in the 
15 El antiparallel orientation in an El and E3 deleted adenovector. 

^MRK Ad5 HIV-1 gag" also tefeired to as "MRKAdSgag" or "Ad5gag2" is 
an adenoviral vector taught herein which is deleted of El, comprises basepairs 1-450 
and 3511-3523, and has a human codon-optimized HIV-1 gene in an El parallel 
orientation imder tfie control of a CMV promoter without intron A. The construct 
20 also comprises a bovine growth hormone polyadenylation signal. 

"pVl JnsHIVgag^ also lefenred to as "HIVFLgagPR9901^ is a p 
comprising the CMV immediate-early (JE) promoter and intron A, a full-length 
codon-optimized HIV gag gene, a bovine growth hormone-derived polyadenylation 
and transcriptional termination sequence, and a minimal pUC backbone. 
25 "pVl JnsCMV(no intron>FLgag-bGHpA" is a plasmid derived fix>m 

pVl JnsHIVgag which is deleted of the intron A portion of CMV and which comprises 
the full length HTV gag gene. This plasmid is also referred to as "pVl JnsHIVgag- 
bOHpA", pVl Jns-hCMV-FL-gag-bGHpA" and "pVlJnsCMV(no intron) + FLgag + 
bGHpA''. 

30 "pVlJnsCMV(no intron)-FLgag-SPA" is a plasmid of the same composition 

as pVl JnsCMV(nQ intron>-FLgag-bGHpA except that the SPA termination sequence 
replaces that of bGHpA. This plasmid is also referred to as "pVl Jns-HTVgag-SPA'* 
and pVl Jns-hCMV-FLgag-SPA". 

"pdelElsplA" is a universal shuttle vector with no expression cassette (i.e., no 

35 promoter or polyA). The vector comprises wildtype adenovirus serotype 5 (Ad5) 
sequences from bp 1 to bp 341 and bp 3524 to bp 5798, and has a multiple cloning 
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site between the Ad5 sequences ending 341 bp and beginning 3524 bp. Hus plasmid 
is also ref&cied to as the original Ad 5 shuttle vector. 
**MRKpdelElsplA" or 'MRKpdelEl(Pac/pIX7pack450r or 
''MRKpdeIEl(Pac/pIX/pack4S0)Clar' is a universal shuttle vector with no expression 
5 cassette (i.e. no promoter or poly A) comprising wildtype adenovirus serotype 5 (Ad5) 
sequences from bpl to bp450 and bp 3511 to bp 5798. The vector has a multiple 
cloning site between the Ad5 sequence ending 450 bp and beginning 3511 bp. This 
shuttle vector may be used to insert the CMV promoter and the bGHpA fragments in 
both the straight ("str". or El parallel) orientation or in the opposite (opp. or El 

10 antiparallel) orientation) 

"MElKpdelEl(Pac/pIX/pack450)4Cl!^min+BGHp is still another 

shuttle vector which is the modified vector tiiat contains the CMV promoter (no 
intron A) and the bGHpA fragments. The expression unit containing the hCMV 
promoter (no intron A) and the bovine growth hormone polyadenylation signal has 

15 been inserted into the shuttle vector such that insertion of the gene of choice at a 

unique BgTH site will ensure the direction of transcription of the transgene will be Ad5 
El parallel when inserted into the MRKpAd5(El/E3+)aal pre-plasmid. This shuttle 
vector, as shown in Figures 22 and 23, was used to insert the respective lApol and 
G2A,LLAA nef genes directly into. 

20 'TV[RKpdelEl-CMV(no intron)-FLgag-bGHpA" is a shuttle comprising Ad5 

sequences from basepairs 1-450 and 351 1-5798, with an expression cassette 
containing human CMV without intron A, the full-length human codon-optimized 
HIV gag gene and bovine growth hormone polyadenylation signal. This plasmid is 
also referred to as "MRKpdelEl shutfle +hCMV-PL-gag-BGHpA" 

25 "MElKpAdHVE3+CMV(no intn>n)-FLgag-bGHpA" is an adenoviral vector 

comprising all Ad5 sequences except those nucleotides encompassing the El region 
(from 451-3510), a human CMV promoter without intron A, a full-length human 
codon-optimized HTV gag gene, and a bovine growth hormone polyadenylation 
signal. This vector is also referred to as ''MRKpAdHVE3 + hCMV-FL-gag- 

30 BGHpA", "MRKpAd5HIV-lgag", 'MElKpAdSgag". "pMRKAd5gag" or 
"pAd5gag2';, 

"pVl Jns-mV-pol inact(opt)" or "pVlJns-HEV lA pol (opt) is the inactivated 
Pol gene (contained within SEQ ID NO:3) cloned into the BgUI site of VlJns (Figure 
17A-C). As noted herein, various derivatives of HIV-l pol may be cloned into a 
35 plasmid expression vector such as VI Jns or VlJns-tPA, thus serving directly as DNA 
vaccine candidates or as a source for subcloning into an appropriate adenoviral vector. 
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"MRKpdel+hCMVmin4-FL-.pol+bGHpA(s)" is the 
"NIRKpdelEl(Pac/p]X/pack450)+CMVmin+BGI^ shuttle mentioned above 

which contains the lA pol gene is the proper orientation. This shuttle vector is used in 
a bacterial recombination with MRKp Ad(El -/E3+)Clal . 
5 "MIlKpAd+hCMVmin+FL-pol+bGHpA(S)E3+", also referred to herein as 

"pMRKAdSpor', is the pre-adenovirus plasmid which comprises a CMV -pol 
inact(opt)-pGHpA construct. The construction of this pre-adenovirus plasmid is 
shown in Figure 22. 

"pVl Jns/nef (G2A,LLAA)" or "VlJns/opt nef (G2AJJLAA)" comprises 

10 codon optimized HTV-l Nef wherein the open reading frame codes for modifications 
at the amino tenninal myristylation site (Gly-2 to Ala-2) and substitution of the Leu- 
174-Leu-175 dileucine motif to Ala-174"Ala-175 (SEQ ID NO: 13; which comprises 
an initiating methionine residue at nucleotides 12-14 and a "TAA" stop codon from 
nucleotides 660-662), ITiis fragment is subcloned into the Bgl n site of VI Jns 

15 and/orVlJns-tPA (Figures 16A-B). As noted above for HIV-1 pol, HIV-l nef 

constructs may be cloned into a plasmid expression vector such as VI Jns or VI Jns- 
tPA, thus serving directly as DNA vaccine candidates or as a source for subcloning 
into an appropriate adenoviral vector. 

"MRKpdemihCMVminFL-nefBGHpA(s)", also refened to herein as 

20 "pMRKAdSnef \ is the pre-adenovirus plasmid which comprises a CMV-nef 

(G2A,LLAA) codon optimized sequence. The construction of this pre-adenovirus 
plasmid is shown in Figure 23. 

BRIEF DESCRIPTION OF THE FIGURES 

25 Figure 1 shows the original HTV-l gag adenovector (AdSHTV-lgag). This 

vector is disclosed in PCT Intemational AppUcation No, PCT/USOO/18332 (WO 
01/02607) filed July 3, 2000, claiming priority to U.S. Provisional Application Serial 
No. 60/142,631, filed July 6, 1999 and U.S. Application Serial No. 60/148,981, filed 
August 13, 1999, all three applications which are hereby incorporated by reference. 

30 Figure 2 shows the nucleic acid sequence (SEQ ID NO: 29) of the optimized 

human HTV-l gag open reading frame. 

Figure 3 shows diagrammatically the new transgene constructs in comparison 
wUfa the original gag transgene. 

Hgure 4 shows the modifications made to the original adenovector backbone 

35 in the generation of the novel vectors of the instant invention. 
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Rguie 5 shows the vims mixing experiments fliat were canied out to 
determine the effects of the addition made to the packaging signal region (Expt #1) 
and the E3 gene on viral growth C^pt #2). The bars denote the region of 
modifications made to the El deletion; 
5 Hgure 6 shows an autoradiograph of viral DNA analysis following the viral 

mixing experiments described in Examples 6 and 7. 

Figures 7A, 7B and 7C are as follows: Figure 7A shows the hCMV-Flgag- 
bGHpA adenovectors constructed within the MRKpAdHVES and MRKpAdHVO 
adenovector backbones. Both El parallel and El antiparallel transgene orientation are 
10 represented. Figure 7B shows the hCMV-Flgag-SPA adenovectors constructed within 
the MRKpAdHVES and MRKpAdHVO adenovector backbones. Again, both El 
parallel and El antiparallel transgene orientation are represented. Figure 7C shows 
the mCMV-Flgag-bGHpA adenovectors constructed within the MRKpAdHVES and 
MRKpAdHVO adenovector backbones. Once again, both El parallel and El 
15 antiparallel transgene orientation are represented. 

Figure 8 A shows the experiment designed to test the effect of transgene 
orientation. - v^- - 

Figure SB shows the experiments designed to test the effect of 
polyadenylation signal. 
20 Figure 9 shows viral DNA from the four adenoviral vectors tested (Example 

12) at P5, following BstEU digestion. 

Figure 10 shows viral DNA analysis of passages 1 1 and 12 of 
MRKpAdHVES, MRKAdSHTV-lgag, and MRKAd5HIV-lgagE3-. 

Figure 1 1 shows viral DNA analysis {HindSi digestion) of passage 6 
25 MRKpAdHVES and MRKAdSHTV-l gag used to initiate flie viral competition study. 
Hie last two lanes ate passage 11 analysis of duplicate passages of the competition 
study (each virus at MOI of 280 viral particles). 

Figure 12 shows viral DNA analysis by Hind m digestion on high passage 
nxunbers for MRKAdSHTV-lgag in senm-containing media with collections made at 
SO specified times. The first lane shows the Ikb DNA size marker. The other lanes 
represent pre-plasmid control (digested with Pacl and Hindni), MRKAdSHTV-lgag 
atP16,P19.andP21. 

Figure 13 shows serum anti-p24 levels at 3 wks post i.m. immunization of 
balb/c mice (n=10) with varying doses of several Adgag constructs: (A) MRK Ad5 
35 HIV-1 gag (through passage 5); (B) MRKAdS hCMV-FLgag-bGHpA (E3-); (C) 
MRKAdS hCMV-FLgag-SPA (E3+); (D) MRKAdS mCMV-ELgag-bGHpA (E3+); 
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(E) research lot (293 cell-derived) of AdSHIV-l gag; and (F) clinical lot 
(Ad5gagFN0001) of AdSHTV-l gag. Reported are the geometric mean titers (GMT) 
for each cohort along with the standard error bars. 

Figure 14 shows a restriction map of the pMRKAd5HIV-lgag vector. 
5 Figures 15A-X illustrates the nucleotide sequence of the pMRKAdSHTV-lgag 

vector (SEQ ID NO:27.[coding] and SEQ ID NO:28 [non-coding]). 

Figures 16A-B shows a schematic representation of DNA vaccine expression 
vectors VlJns (A) and VI Jns-fPA OB), which are utilized for HIV-1 gag, pol and nef 

'I 

constructs in various DNA/viral vector combined modality regimens as disclosed 
10 herein. 

Rgures 17A-C shows the nucleotide (SEQ ID NO:3) and amino acid sequence 
(SEQ ID NO:4) of lA-PoL Underlined codons and amino acids denote mutations, as 
listed in Table 1. 

Ingure 18 shows codoii optimized nucleotide and amino add sequences 
1 5 ftrough the fusion junction of tP A-pol inact(opt) (contained within SEQ ID NOs: 7 
and 8, respectively). The underlined portion represents the NH2-terminaI region of 
lA-Pol. 

Figures 19A-B show a nucleotide sequence comparison between wild type 
nef (jrfl) and codon optimized nef. The wild type nef gene from the jrfl isolate 
20 consists of 648 nucleotides capable of encoding a 216 amino acid polypeptide. WT, 
wild type sequence (SEQ ID NO: 19); opt, codon-optimized sequence (contained 
within SEQ ID NO:l). The Nef amino acid sequence is shown in one-letter code 
(SEQ ID N0:2). 

Figures 20A-C show nucleotide sequences at junctions between nef coding 
25 sequence and plasmid backbone of nef expression vectors VlJns/nef (Figure 20A), 

VlJns/nef(G2A,LLAA) (Figure 20B), VlJns/tpanef (Figure 20C) and 

VlJns/tpanef(LLAA) (Figure 20C, also). 5* and 3 'flanking sequences of codori 

optimized nef or codon optimized nef mutant genes are indicated by bold/italic letters; 

nef and nef mutant coding sequences are indicated by plain letters. Also in(iicated (as 
30 underlined) are the restriction endonuclease sites involved in construction of 

respective nef expression vectors, VlJns/tpanef and VlJns/tpanef(LLAA) have 

identical sequences at the junctions. 

Figure 21 shows a schematic presentation of nef and nef derivatives. Amino 

acid residues involved in Nef derivatives are presented. Glycine 2 and Leucinel74 
35 and 175 are the sites involved in myristylation and dileucine motif, respectively. For 

both versions of the tpanef fusion genes, the putative leader peptide cleavage sites are 
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indicated with and a exogenous serine residue introduced during the construction 
of the mutants is underlined. 

Figure 22 shows diagrammatically the construction of the pre-adenovirus 
plasmid construct, MRKAdSPol. 
S Figure 23 shows diagramioatically the construction of the pre-adenovirus 

plasmid construct, MRKAdSNef . 

Figure 24 shows a comparison of clade B vs. clade C anti-gag T cell responses 
in clade B HIV-infected subjects. 

Figure 25 shows a comparison of clade B vs. clade C anti-nef T cell responses 
10 in clade B HIV-infected subjects. 

Figures 26 A-AO illustrates flie nucleotide sequence of the pMRKAdSHIV- 
Ipol adenoviral vector (SEQ ID NO:32 [coding] and SEQ ID NO:33 [non-coding]), 
comprising the coding region of the inactivated pol gene (SEQ ID N03). 

Figures 27A-AM illustrates the nucleotide sequence of the pMRKAdSHTV-l 
15 nef adenoviral vector (SEQ ID NO:34 [coding] and SEQ ID NO:35 [non-coding]), 
comprising the coding region of the inactivated pol gene (SEQ ID NO 13). 

Figure 28 shows the stability of MRKAd5 vectors comprising various 
promoter fragments (hCMV or mCMV) and terminations signals (bGH or SPA) in 
E3(+) or E3(-) backbones. 
20 Figures 29A and B shows the anion-exchange HPLC viral particle 

concentrations of the freeze-thaw recovered cell associated virus at the 24, 36, 48, and 
60 hpi time points (Figure 29 A) and the timcourse QPA supematant titers (Figure 
29B) for MRKAd5gag, MRKAdSpol and M{lKAd5nef . 

Figure 30 shows the nucleotide sequence (SEQ ID NO:36) and amino acid 
25 sequence (SEQ ID NO:37) comprising the open reading frame of a representative 
tPA-gag fusion for use in the DNA and/or adenoviral vacdne disclosed herein. 

Figure 3 1 shows the intracellular yJFN staining of PBMCs collected at week 
10 (post DNA prime) and week 30 (post Ad boost). The cells were stimulated 
overnight in the presence or absence of the gag peptide pool. They were subsequently 
30 stained using fluorescence-tagged anti-CD3, anti-CD8, anti-CD4, and anti-ylFN 

monoclonal antibodies. Each plot shows aD CD3+ T cells which were segregated in 
terms of positive staining for surface CDS and ylFN production. The numbacs in the 
upper right and lower right quadrants of each plot are the percentages of CD3^ cells 
that were CD8"^YlH^ and OM^YlFN**, respecti^^ 
35 Figure 32 shows a comparison of single-modality adenovirus immunization 

with DNA + adjuvant prime/adenovirus boost immunization. 
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Figures 33A-B show the nucleotide sequence (SEQ ID NO: 38) of the open 
reading frame for the gag-IApol fusion of Example 29. 

Figures 34A-B show the protein sequence (SEQ ID NO:39) of the gag-IApol 
fastion frame. 

5 

DETAIUBD DESCRiraON OF THE INVENTION 

A novel replication-defective, or "jHrst generation," adenoviral vector suitable 
for use in gene therapy or nucleotide-based vaccine vectors is described. This vector 
is at least partially deleted in El and comprises a wildtype adenovirus ciVacting 

10 packaging region from about base pair 1 to between about base pair 342 (more 
preferably, 400) to about 458 (preferably, 1-450) and. preferably, 3511-3523 of a 
wild-type adenovirus sequence. It has been found that a vector of this description 
possesses enhanced growth characteristics, with approximately 5-10 fold greater 
amplification rates, and is more potent allowing lower doses of virus to be used to 

15 generate equivalent immunity. The vector, furthermore, generates a harvested 
recombinant adenovirus which shows greats cellular-mediated inmiune responses 
than replication-deficient vectors not comprising this region (basepairs 342-450). 
Adenoviral constructs derived from these vectors are, further, very stable genetically, 
particularly those comprising a transgene under the control of a hCMV promoter 

20 devoid of intron A. Viruses in accordance with this description were passaged 

continually and analyzed; see Example 12. Each virus analyzed maintained it correct 
genetic structure. Analysis was also carried out under propagation conditions similar 
to that performed in large scale production. Again, the vectors were found to possess 
enhanced genetic stability; see Figure 12. Following 21 passages, the viral DNA 

25 showed no evidence of rearrangement, and was highly reproducible from one 
production lot to the next The outcome of all relevant tests indicate that the 
adenoviral vector is extremely well suited for large-scale production of recombinant, 
replication-deficient adenovirus, as shown herein with the data associated with Figure 
28. 

30 A preferred adenoviral vector in accordance with this description is a vector 

comprising basepairs 1-450, which is deleted in E3. This vector can accommodate up 
to approximately 7,500 base pairs of foreign DNA inserts (or exogenous genetic 
material). Another preferred vector is one retaining E3 which comprises basepairs 1- 
450. A preferred vector of this description is an E3+ vector comprising basepairs 1- 

35 450 and 3511-3523. This vector, when deleted of the region spanning basepairs 451- 
3510, can accommodate up to approximately, 4,850 base pairs of foreign DNA inserts 

21 
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(or exogenous genetic material). The cloning capacities of the above vectors have 
been deteimined using 105% of the wUdtype Ad5 sequence as the upper genome size 
limit 

Wildtype adenovirus serotype S is used as the basis for the specific basepair 
5 numbers provided throughout the specification. The wildtype adenovirus serotype 5 
sequence is known and described in the art; see^ Chroboczek et al., 1992 7. Virology 
186:280, which is hereby incorporated by reference. Accordingly, a particular 
embodiment of the instant invention is a vector based on the adenovirus serotype 5 
sequence. One of skill in the art can readily identify the above regions in other 

10 adenovirus serotypes (e.g., serotypes 2, 4, 6, 12, 16, 17, 24, 31, 33, and 42), regions 
defined by basepairs coiresponding to the above basepair positions given for 
adenovirus serotype 5. Accordingly, the instant invention encompasses all adenoviral 
vectors partially deleted in El comprising basepairs corresponding to 1-450 
(particularly, 342-450) and, preferably, 3511-3523 of a wild-type adenovirus serotype 

15 5 (Ad5) nucleic acid sequence. Particularly preferred embodiments of the instant 
invention are those derived from adenoviruses like Ad5 which are classified in 
- subgroup C (e.g., Ad2). - 

Vectors in accordance with the instant invention are at least partially deleted in 
El. Preferably the El region is completely deleted or inactivated. Most preferably, 

20 the region deleted of El is widiin basepairs 451-3510. It is to be noted that the 

extended 5' and 3' regions of the disclosed vectors are beheved to effectively reduce 
the size of the El deletion of previous constructs without overlapping any part of the 
EIA/EIB gene present in the cell line used, i.e., the PER.C6® cell line transefected 
with base pairs 459-35 10. Overlap of adenoviral sequences is avoided because of the 

25 possibility of recombination. One of ordinary skill in the art can certainly appreciate 
that the instant invention can, therefore, be modified if a different cell line transfected 
with a different segment of adenovirus DNA is utilized. For purposes of 
exemplification, a 5' region of base pairs 1 to up to 449 is more appropriate if a cell 
line is transfected with adenoviral sequence from base pairs 450-3510. This holds 

30 true as well in the consideration of segments 3' to the El deletion. 

Preferred embodiments of the instant invention possess an intact £3 region 
(i.e., an E3 gene capable of encoding a functional E3). Alternate embodiments have a 
partially deleted E3, an inactivated E3 region, or a sequence completely deleted of E3. 
Applicants have found, in accordance with the instant invention, that virus comprising 

35 the E3 gene were able to amplify more rapidly compared with virus not comprising an 
E3 gene; see Figure 6 wherein a diagnostic CsCl band corresponding to the E3+ virus 
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tested (5,665 bp) was present in greater amount compared with the diagnostic band of 
3,010 bp corresponding to the E3- virus. These results were obtained following a 
virus competition study involving mixing equal MOI ratio (1:1) of adenovectors both 
comprising the E3 gene and not cocoprising the E3 gene. This increased amplificadon 
5 capacity of the E3+ adenovectors was subsequently confirmed with growth studies; 
see Table 4A, wherein the E3+ virus exhibit amplification ratios of 470, 420 and 320 
as compared with the 1 15 and 40-50 of the E3- constructs. 

As stated above, vectors in accordance with the instant invention can 
accommodate up to approximately 4,850 base pairs of exogenous genetic material for 

10 an E3+ vector and approximately 7,500 base pairs for an E3- vector. Preferably, the 
insert brings the adenoviral vector as close as possible to a wild-type genomic size 
(e.g., for Ad5, 35,935 basepairs). It is well known that adenovirus amplifies best 
when they are close to their wild-type genomic size. 

The genetic material can be inserted in an El-paraUel or an El anti-parallel 

15 orientation, as such is illustrated in Figure 7 A, 7B, 7C and Figure 8 A. Particularly 
preferred embodiments of the instant invention, have the insert in an El-parallel 
orientation. Applicants have found, via competition experiments with plasmids 
containing transgenes in differing orientation (Figure 8A), that vector constructs with 
the foreign DNA insert in an El-parallel orientation amplify better and actually out- 

20 compete El-antiparallel-oriented transgenes. Viral DNA analysis of the mixtures at 
passage 3 and certainly at passage 6, showed a greater ratio of the virus carrying the 
transgene in the El parallel orientation as compared with the El anti-parallel version. 
By passage 10,, the only viral species observed was the adenovector with the transgene 
in the El parallel orientation for both transgenes tested. 

25 Adenoviral vectors in accordance with the instant invention are particularly 

well suited to effectuate expression of desired proteins, one example of which is an 
HIV protein, particularly an HIV full length gag protein. Exogenous genetic material 
encoding a protein of interest can exist in the form of an expression cassette. A gene 
expression cassette preferably comprises (a) a nucleic acid encoding a protein of 

30 interest; (b) a heterologous promoter operatively linked to the nucleic acid encoding 
the protein; and (c) a transcription temrinator. 

The transcriptional promoter is preferably recognized by an eukaryotic RNA 
polymerase. In a preferred embodiment, the promoter is a "strong" or "efficient" 
promoter. An example of a strong promoter is the immediate early human 

35 cytomegalovirus promoter (Chapman et al, 1991 Nucl Acids ieejl9:3979-3986, which 
is incorporated by reference), preferably without intronic sequences. Most preferred 
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for use within the instant adenoviral vector is a human CMV promoter without 
intronic seqeunces, like intron A. Applicants have found that intron A, a portion of 
the human cytomegalovirus promoter (hCMV), constitutes a region of instability for 
adenoviral vectors. CMV without intron A has been found to effectuate (Examples 1- 
S 3) comparable expression capabilities in vitro when driving HIV gag expression and» 
furthermore, behaved equivalently to intron A-containing constructs in Balb/c mice in 
vivo with respect to their antibody and T-cell responses at both dosages of plasmid 
DNA tested (20 fig and 200 jig). Those skilled in the art will appreciate that any of a 
number of other known promoters, such as the strong immunoglobulin, or other 

10 eukaryotic gene promoters may also be used, including the EFl alpha promoter, the 
murine CMV promoter, Rous sarcoma virus (RS V) promoter, S V40 early/late 
promoters and the beta-actin promoter. 

In preferred embodiments, the promoter may also comprise a regulatable 
sequence such as the Tet operator sequence. This would be extremely useful, for 

15 example, in cases where the gene products are effecting a result other than that 
desired and repression is sought. 

Preferred transcription termination sequences present within the gene 
expression cassette are the bovine growth hormone terminator/polyadenylation signal 
(bGHpA) and the short synthetic polyA signal (SPA) of 50 nucleotides in length, 

20 defined as f oUows: AATAAAAGATCTITATTTTCATTAGATCTGTGTGTTGGT- 
ITilTGTGTG (SEQ ID NO:26). 

The combination of the CMV promoter (devoid of the intron A region) with 
the BGH terminator is particularly preferred although other promoter/terminator 
combinations in the context of FG adenovims may also be used. 

25 Other embodiments incorporate a leader or signal peptide into the transgene. 

A preferred leader is that from the tissue-specific plasminogen activator protein, tPA. 
Examples include but are not limited to the various tPA-gag, tPA-pol and tPA-nef 
adenovirus-based vaccines disclosed throughout this specification. 

In view of the improved adenovirus vectors described herein, an essential 

30 portion of the present invention are adenoviral-based HIV vaccines comprising said 
adenovims backbones which may be administered to a mammalian host, preferably a 
human host, in either a prophylactic or dierapeutic setting. The HTV vaccines of the 
present invention, whether administered alone or in combination regimens with other 
viral- or non-viral-based DNA vaccines, should elicit potent and broad cellular 

35 immune responses against HTV that will either lessen the likelihood of persistent virus 
infection and/or lead to the establishment of a clinically significant lowered virus load 



wo 02/22080 



PCTAJSOl/28861 



subject to HIV infection or in combination with HAART therapy, mitigate the effects 
of previously established HIV infection (antiviral iinmunotherapy(ARI)). While any 
HTV antigen (e.g., gag, pol, nef, gpl60, gp41, gpl20, tat, rev, etc.) may be utilized in 
the h^in described recombinant adenoviral vectors, preferred embodiments include 
5 the codon optimized pS5 gag antigen (herein exemplified as MRKAdSgag), pol and 
nef. Sequences based on different Clades of HTV-1 are suitable for use in the instant 
invention, most preferred of which are Clade B and Qade C. Particularly preferred 
embodiments are those sequences (especiaUy, codon-optimized sequences) based on 
concensus Clade B sequences. Preferred versions of the MElKAdSpol and 

10 MRKAdSnef series of adenoviral vaccines will encode modified versions of pol or 
nef, as discussed herein. Prefmed embodiments of the MRKAd5HIV-l vectors 
carrying HTV envelope genes and modifications thereof comprise the HIV codon- 
optimized env sequences of PCT International Applications PCT/US97/02294 and 
PCTAJS97/10517, published August 28, 1997 (WO 97/31115) and December 24, 

15 1997, respectively; both documents of which are hereby incorporated by reference. 

A most preferred aspect of the instant invention is the disclosed use of the 
adenoviral vector described above to effectuate expression of HIV gag. Sequences 
for many genes of many HIV strains are publicly available in GENB ANK and 
primary, field isolates of HIV are available from the National Institute of Allergy and 

20 Mectious Diseases (NIAID) which has contracted with Quality Biological 

(Gaithersburg, MD) to make these strains available. Strains are also available from 
the World Health Organization (WHO), Geneva Switzerland It is preferred that the 
gag gene be from an HIV-l strain (CAM-1; Myers et al, eds. "Human Retroviruses 
and AIDS: 1995, 1[A3-IIA19, which is hereby incorporated by reference). This gene 

25 closely resembles the consensus amino acid sequence for the clade B (North 
American/European) sequence. Therefore, it is within the purview of the skilled 
artisan to choose an appropriate nucleotide sequence which encodes a specific HIV 
gag antigen, or unmunologically relevant portion thereof. As shown in Example 25, a 
clade B or clade C based p55 gag antigen wiD potentially be useful on a global scale. 

30 As noted herein, the transgene of choice for insertion in to a DNA or MRKAd-based 
adenoviral vector of the present invention is a codon optimized version of p55 gag. . 
Such a MRKAdSgag adenoviral vector is documented in Example 11 and is at least 
referred to herein as MRKAd5HIV-lgag. Of course, additional versions are 
contemplated, including but not limited to modifications such as promoter (e.g., 

35 mCMV for hCMV) and/or pA-terminations signal (SPA for bGH) switching, as well 
as generating MRK Ad5 backbones with or without deletion of the Ad5 E3 gene. 

25 
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The present inventioii also relates a series of MKKAdSpol-based adenoviral 
vaccines which are shown herein to generate cellular immune responses subsequent to 
admimstration in mice and non-human primate studies. Several of the MRKAdSpol 
series are exemplified herein. One such adenoviral vector is referred to as 
5 MRKAdShCMV-inact opt pol(E3+), which comprises the MRKAdS backbone, the 
hCMV promoter (no intron A), an inactivated pol transgene, and contains the Ad5 E3 
gene in the adenoviral backbone. A second exeniplified pre-adenovirus plasmid and 
concomitant virus is referred to as MRKAdShCMV-inact opt pol(E3-). which is 
identical to the former adenoviral vector except that the E3 is deleted. Both 

10 constructions contain a codon optimized, inactivated version of HTV-l Pol, wherein at 
least the entire coding region is disclosed herein as SEQ ID NO;3 and the expressed 
protein is shown as SEQ ID NO:4 (see also Figure 17A-C and Table 1, which show 
targeted deletion for inactivated pol. This and other preferred codon optimized 
versions of HTV Pol as disclosed herein are essentially as described in U.S. 

15 Application Serial No. 09/745,221, filed December 21 , 2000 and PCT International 
Application PCT/USOQ/34724. also filed DecembCT 21, 2000, both documents which 
are hereby incorporated by reference. As disclosed in the above-mentioned 
documents, the open reading frame for these codon-optimized HTV-l Pol-based DNA 
vaccines are represented by codon optimized DNA molecules encoding codon 

20 optimized HTV-l Pol (e.g. SEQ ID NO:2), codon optimized HTV-l Pol fused to an 
amino terminal localized leader sequence (e,g. SEQ ID NO:6), and especially 
preferable, and exemplified by the MRKAdS-Pol construct in e.g.. Example 19, 
biologically inactivated pol ("inact opt Pol"; e.g., SEQ ID N0:4) which is devoid of 
significant PR, RT, RNase or IN activity associated with wild type Pol. In addition, a 
' 25 construct related to SEQ ID NO:4 is contemplated which contains a leader peptide at 
the amino terminal region of the lA Pol protein. A specific construct is hgated within 
an appropriate DNA plasmid vector containing regulatory regions operatively linked 
to the respective HIV-1 Pol coding region, with or without a nucleotide sequence 
encoding a functional leader peptide. To this end, various HIV-1 Pol constructs 

30 disclosed herein relate to open reading firames for cloning to the enhanced first 
generation Ad vectors of the present invention (such a series of MRKAdSpol 
adenoviral vaccine vectors), including but not limited to wild type Pol (comprising the 
DNA molecule encoding WT opt Pol, as set forth in SEQ ID NO:2), tPA-opt WTPol, 
(comprising the DNA molecule encoding tPA Pol, as set forth in SEQ ID N0:6), inact 

35 opt Pol (comprising the DNA molecule encoding lA Pol, as set forth in SEQ ID 

N0:4), and tPA-inact opt Pol, (comprising the DNA molecule encoding tPA-inact opt 

26 
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Pol, as set forth in SEQ ID NO: 8). The pol-based versions of enhanced first 
generation adenovirus vaccines elicit CTL and Th cellular immune responses upon 
administration to the host, including primates and especially humans- As noted in the 
above, an effect of the cellular immune-directed vaccines of the present invention 
5 should be a lower transmission rate to previously uninfected individuals and/or 

reduction in the levels of the viral loads within an infected individual, so as to prolong 
the asymptomatic phase of mV-l infection. 

The present invention further relates to a series of MRKAdSnef-based 
adenoviral vaccines which, similar to HIV gag and pol antigens, generate cellular 

10 immune responses subsequent to administration in mice and non-human primate 
studies. The MRKAdSnef series are exemplified herein by utilizing the improved 
MRK adenoviral backbone in combination with modified versions of HIV nef . These 
exemplified MRKAdSnef vectors are as follows: (1) MRKAdShCMV- 
nef (G2A,LLAA) (E3+), which comprises the improved MRKAdS backbone, a human 

15 CMV promoter an intact Ad5 E3 gene and a modified nef gene: (2) MRKAdSmCMV- 
nef(G2A.LLAA) ^3+), which is the same as (1) above but substituting a murine 
CMV promoter for a human CMV promoter, and (3) MElKAd5mClvIV-tpanef(LLAA) 
(E3+), which is the same as (2) except that the nef transgene is tpanef(LLAA). Codon 
optimized versions of HIV-1 Nef and HIV-1 Nef modifications are essentially as 

20 described in U.S. AppUcation Serial No. 09/738,782, filed December 15, 2000 and 
PCT Ihtemational Application PCT/USOO/34162, also filed December 15, 2000, both 
documents which are hereby incorporated by reference. Particular embodiments of 
codon optimized Nef and Nef modifications relate to a DNA molecule encoding fflV- 
1 Nef from the HIV-1 jfrl isolate wherein the codons are optimized for expression in a 

25 mammalian system such as a human. The DNA molecule which encodes this protein 
is disclosed herein as SEQ ID NO:9, while the expressed open reading frame is 
disclosed. herein as SEQ ID NO: 10. Another embodiment of Nef-based coding 
regions for use in the adenoviral vectors of the present invention comprise a codon 
optimized DNA molecule encoding a protein containing the human plasminogen 

30 activator (tpa) leader peptide fused with the NH2-terminus of the HTV-l Nef 

polypeptide. The DNA molecule which encodes this protein is disclosed herein as 
SEQ ID NO: 1 1 , while the expressed open reading frame is disclosed herein as SEQ 
ID NO: 12. Another modified Nef optimized coding region relates to a DNA molecule 
encoding optimized HIV-1 Nef wherein the open reading frame codes for 

35 modifications at the amino terminal myristylation site (Gly-2 to Ala-2) and 

substitution of the Leu-174-Leu-175 dileucine motif to Ala-174-Ala-175, herein 

27 
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described as opt nef (G2A, LLAA). The DNA molecule which encodes this protein is 
disclosed herein as SEQ ID NO:13» while the expressed open reading frame is 
disclosed herein as SEQ ID NO: 14. MRKAdSnef vectors (1) MRKAdShCMV- 
nef (G2A JIAA) (E3+) and (2) MRKAd5mCMV-nef (G2A JiAA) CE3+) contain this 
5 transgene. An additional enilx)diment relates to a DNA niolec 

mV-l Nef wherein the amino terminal myristylation site and dileucine motif have 
been deleted, as well as comprising a tPA leader peptide. This DNA molecule, opt 
tpanef (LLAA), comprises an open reading frame which encodes a Nef protein 
containing a tPA leader sequence fused to amino acid residue 6-216 of HIV-1 Nef 

10 (jfrO* wherein Leu-174 and Leu-175 are substituted with Ala-174 and Ala-175, herein 
referred to as opt tpanef (LLAA) is disclosed herein as SEQ ID NO: 15, while the 
expressed open reading frame is disclosed herein as SEQ ID NO: 16. The 
MRKAdSnef vector 'MRKAdSmCMV-tpanefOLLAA) (E3+)" contains this transgene. 
Along with the improved MRKAdSgag adenovirus vaccine vector described 

15 herein, generation of a MRKAdSpol and MRKAdSnef adenovirus vector provide for 
enhanced HIV vaccine capabilities. Namely, the generation of this trio of adenoviral 
vaccine vectors^ all shown to generate effective cellular immune responses subsequent 
to host administration, provide for the ability to administer these vaccine candidates 
not only alone, but preferably as part of a divalent (i.e., gag and nef, gag and pol, or 

20 pol and nef components) or a trivalent vaccine (i.e., gag, pol and nef components). 
Therefore, a preferred aspect of the present invention are vaccine formulations and 
associated methods of administration and concomitant generation of host cellular 
immune responses associated v^th formulating three separate series of MRKAdS- 
based adenoviral vector vaccines. Of course, this MRKAdS vaccine series based on 

25 distinct HIV antigens promotes expanded opportunities for formulation of a divalent 
or trivalent vaccine, or possibly administration of separate formulations of one or 
more monovalent or divalent formulations withm a reasonable window of time. It is 
also within the scope of the present invention to embaric on combined modality 
regimes which include multiple but distinct components from a specific antigen. An 

30 example, but certainly not a limitation, would be separate MRKAdSpol vectors, with 
one vaccine vector expressing wild type Pol (SEQ ID NO:2) and another 
MRKAdSpol vector expressing inactivated Pol (SEQ ID NO:6). Another example 
might be separate MRKAdSnef vectors, with one vaccine vector expressing the 
tPA/LLAA version of Nef (SEQ ID NO: 16) and another MRKAdSnef vector 

35 expressing the G2AJJLAA modified version of Nef (SEQ ID NO: 14). Therefore, the 
MRKAdS adenoviral vectors of the present invention may be used in combination 

28 . 
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with multiple, distinct HIV antigen classes. Each HIV antigen class is subject to 
sequence manipulation, thus providing for a multitude of potential vaccine 
combinations; and such combinations are within the scope of the present invention. 
The utilization of such combined modalities vaccine formulation and administration 
5 increase the probability of eliciting an even more potent ceUular immune response 
when compared to inoculation with a single modality regimeh. 

The preset invention also relates to application of a mono-, dual-, or tri- 
modality administration regime of the MElKAdSgag, pol and nef adenoviral vaccine 
series in a prime/boost vaccination schedule. This prime/boost schedule may include 

10 any reasonable combination of the MRKAdSgag, pol and nef adenoviral vaccine 

series disclosed herein. In addition, a prime/boost regime may also involve other viral 
and/or non- viral DNA vaccines. A preferable addition to an adenoviral vaccine 
vector regime includes but is not limited to plasmid DNA vaccines, especially DNA 
plasmid vaccines that contain at least one of the codon optimized gag, pol and nef 

15 constructions, as disclosed herein. 

Therefore, one aspect of this invention is the administration of the adenoviral 
vector containing the optimized gag gene in a prime/boost regiment in conjunction 
with a plasmid DNA encoding gag. To distinguish this plasmid from the adenoviral- 
containing shuttle plasmids used in the construction of an adenovirus vector, this 

20 plasmid will be referred to as a "vaccine plasmid" or "DNA plasmid vaccine". 
Preferred vaccine plasmids for use in this administration protocol are disclosed in 
pending U.S. patent application 09/017.981, filed Febmary 3, 1998 and WO98/34640, 
published August 13, 1998, both of which are hereby incorporated by reference. 
Briefly, the preferred vaccine plasmid is designated VI Jns-FLgag, which expresses 

25 the same codon-optimized gag gene as the adenoviral vectors of this invention (see 
Figure 2 for the nucleotide sequence of the exemplified optimized codon version of 
full length p55 gag). The vaccine plasmid backbone, designated VI Jns contains the 
CMV immediate-early (IE) promoter and intron A, a bovine growth hormone-derived 
polyadenylation and transcription termination sequence as the gene expression 

30 regulatory elements, and a minimal pUC backbone; see Montgom^ et aL, 1993, 
DNA Cell Biol 12:777-783. The pUC sequence pemiits high levels of plasmid 
production in E. coli and has a neomycin resistance gene in place of an ampiciUin 
resistance gene to provide selected growth in the presence of kanamycin. 
Alternatively, a vaccine plasmid which has the CMV promote deleted of intron A can 

35 be used. Those of skill in the art will recognize diat alternative vaccine plasmid 
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vectors may be easily substituted for these specific constructs, and this invention 
specifically envisions use of such alternative plasmid DNA vaccine vectors. 

Another aspect of the preset invention is a prime/boost regimen which 
includes a vaccine plasmid which encodes an HIV pol anti^n, preferably a codon 
5 optimized form of pol and also preferably a vaccine plasmid which comprises a - - 
nucleotide sequence which encodes a Pol antigen selected £rom the group of Pol 
antigens as shown in SEQ ID NOs: 2, 4, 6 and 8. The variety of potential DNA 
plasmid vaccines which encode various biologically active forms of HIV-1 Pol, 
wherein administration, intracellular delivery and expression of the HIV-l Pol gene of 

10 interest elicits a host CTL and Th resporise. The preferred syntiietic DNA molecules 
of the present invention encode codon optimized wild type Pol (without Pro activity) 
and various codon optimized inactivated HIV-1 Pol proteins. The HTV-l pol open 
reading disclosed herein are especially preferred for pharmaceutical uses, especially 
for human administration as delivered via a recombinant adenoviral vaccine, 

15 especially an enhanced first generation recombinant adenoviral vaccine as described 
herein. Several embodiments of this portion of the invention are provided in detail 
below, namely DNA molecules which comprise a HIV-1 pol open reading iframe, 
whether encoding full length pol or a modification or fusion as described herein, 
wherein the codon usage has been optimized for expression in a mammal, especially a 

20 human. Again, these DNA sequences are positioned appropriately within a 

recombinant adenoviral vector, such as the exemplified recombinant adenoviral vector 
described herein, so as to promote expression of the respective HIV-l Pol gene of 
interest, and subsequent to administration, elicit a host CTL and Th response. Again, 
these preferred, but in no way limiting, pol genes are as disclosed herein and 

25 essentially as described in U.S. Application Serial No. 09/745,221, filed December 
21, 2000 and PCT International Apphcation PCT/USOO/34724, also filed December 
21, 2000, both documents which are hereby incorporated by reference. 

A third series of vaccine plasmids which are useful in a combined modality 
and/or piime/boost regimen are vaccine plasmids which encode an HIV nef antigen or 

30 biologically and/or inmiunologically relevant modification thereof. As noted 

elsewhere, preferred vaccine plasmids contain a codon optimized form of nef and also 
preferably comprise a nucleotide sequence which encodes a Nef antigen selected from 
the group of Nef antigens as shown in SEQ ID NOs: 10, 12, 14 and 16. These 
preferred nef coding regions are disclosed herein, as well as being described in U.S. 

35 Application Serial No. 09n38,782, filed December 15, 2000 and PCT International 
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Application PCT/tJSOO/34162, also filed December 15, 2000. both documents which 
are hereby incorporated by reference. 

Therefore, the adenoviral vaccines and plasmid DNA vaccines of this 
invention may be administered alone, or may be part of a prime and boost 
5 administration regimen. A mixed modality priming and booster inoculation scheme 
will result in an enhanced immune response, particularly is pre-existing anti-vector 
immune responses are present. This one aspect of this invention is a method of 
priming a subject with the plasmid vaccine by administering the plasmid vaccine at 
least one time, allowing a predetermined length of time to pass, and then boosting by 

10 administ^ng the adenoviral vaccine. Multiple primings typically, 1-4, are usually 
employed, although more may be used. The lengtti of time between prinaing and 
boost may typically vary from about four months to a year, but other time frames may 
be used. In experiments with Aesus monkeys, the animals were primed four times 
with plasmid vaccines, then were boosted 4 months later with the adenoviral vaccine. 

15 Their cellular immune response was notably higher than that of animals which had 
only received adenoviral vaccine. The use of a priming regimen may be particularly 
piefenred in situations where a person has a pre-existing anti-adenovirus immune 
response. 

Furthermore and in the altanative, multiple HIV-1 viral antigens, such as the 

20 MRKAdS adenoviral vaccines disclosed herein, may be ligated into a proper shuttle 
plasmid for generation of a pre-adenoviral plasmid comprising multiple open reading 
frBxncs. For example a trivalent vector may comprise a gag-pol-nef fusion, in either a 
E3(-) or E3(+) background, preferably a E3 deleted backbone, or possible a "2+1" 
divalent vaccine, such as a gag-pol fusion (i.e., codon optimized p55 gag and 

25 inactivated optimized pol; Example 29 and Table 25) within the same MRKAd5 
backbone, wdth each open reading frame being operatively linked to a distinct 
promoter and transcription termination sequence. Alternatively, the two open reading 
frames may be operatively linked to a single promoter, with the open reading frames 
operatively linked by an internal ribosome entry sequence ORES), as disclosed in 

30 International Publication No. WO 95/24485, which is hereby incorporated by 
reference. Figure 9 shows that the use of multiple promoters and termination 
sequences provide for similar growth properties, while Figure 28 shows that these 
MRKAd5gag-based vectors are also stable at least through passage 21. In the absence 
of the use of IRES-based technology, it is preferred that a distinct promoter be used to 

35 support each respective open reading frame, so as to best preserve vector stability. As 
examples, and certainly not as limitations, potential multiple transgene vaccines may 
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include a three transgene vector such as hCMV-gagpol-bGHpA + mCMV-nef-SPA in 
an E3 deleted backbone or hCMV-gagpol-bGi^A + mCMV-nef-SPACE3+). 
Potential "2+1" divalent vaccines of the present invention naigjit be a hCMV-gag- 
bGHpA + mCMV-nef-SPA in an E3+ backbone (vector #1) in combination with 
5 hCMV-pol-bGHpA in an E3+ backbone (vector #2), with all transgenes in the El 
parallel orientation. Fusion constructs other than the gag-pol fusion described above 
are also suitable for use in various divalent vaccine strategies and can be composed of 
any two HTV antigens fused to one another (e.g.,, nef-pol and gag-nef). These 
adenoviral compositions are, as above, preferably delivered along with an adenoviral 

10 composition comprising an additional HIV antigen in order to diversify the immune 
response generated upon administration. Tlierefore, a multivalent vaccine delivered 
in a single, or possible second, adenoviral vector is certainly contemplated as part of 
the present invention. Again, this mode of administration is another example of 
whereby an efQcaceous adenovirus-based HIV-1 vaccine may be administered via a 

15 combined modality regime. It is important to note, however, that in terms of deciding 
on an insert for the disclosed adenoviral vectors, due consid^tion must be dedicated 
to the effective packaging limitations of the adenovirus vehicle. Adenovirus has been 
shown to exhibit an upper cloning capacity limit of approximately 105% of the 
wildtype Ad5 sequence. 

20 Regardless of the gene chosen for expression, it is preferred that the sequence 

be "optimized" for expression in a human cellular environment. A "triplet" codon of 
four possible nucleotide bases can exist in 64 variant forms. That these forms provide 
the message for only 20 different amino acids (as well as transcription initiation and 
termination) means that some amino acids can be coded for by more than one codon. 

25 Indeed, some amino acids have as many as six "redundant", alternative codons while 

m 

some others have a single, required codon. For reasons not completely understood, 
alternative codons are not at all uniformly present in the endogenous DNA of 
differing types of cells and there appears to exist variable natural hierarchy or 
"preference" for certain codons in certain types of cells. As one example, the amino 

30 acid leucine is specified by any of six DNA codons including CTA, CTC, CTG, CIT, 
TTA, and TTG (which correspond, respectively, to the mRNA codons, CUA, CUC, 
CUG, CUU, UUA and UUG). Exhaustive analysis of genome codon frequencies for 
microorganisms has revealed endogenous DNA of E. coli most commonly contains 
the CTG leucine-specifying codon, while the DNA of yeasts and slime molds most 

35 conraionly includes a TTA leucine-specifying codon. In view of this hierarchy, it is 
generally held that the likelihood of obtaining high levels of expression of a leucine- 

32 
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rich polypeptide by an £1 coli host will depend to some extent on the frequency of 
codon use. For example, a gene rich in TTA codons will in all probability be poorly 
expressed in E. coli, whereas a CTG rich gene will probably highly express the 
polypeptide. Similarly, when yeast cells are the projected transformation host cells 
5 for expression of a leucine-rich polypeptide, a preferred codon for use in an inserted 

DNA would be TTA. 

The implications of codon preference phenomena on recombinant DNA 
techniques are manifest, and the phenomenon may serve to explain many prior 
failures to achieve high expression levels of exogenous genes in successfully 

10 transformed host organisms-a less "preferred" codon may be repeatedly present in the 
inserted gene and the host cell machinery for expression may not operate as 
efficiently. Ttds phenomenon suggests that synthetic genes which have been designed 
to include a projected host cellls preferred codons provide a preferred form of foreign 
genetic material for practice of recombinant DNA techniques. Thus, one aspect of 

15 this invention is an adenovirus vector or adenovirus vector in some combination with 
a vaccine plasmid where both specifically include a gene which is codon optimized 
for expression in a human cellular environment As noted herein, a preferred gene 
for use in the instant invention is a codon-optimized HIV gene and, particularly, HIV 
gag, pol or nef 

20 Adenoviral vectors in accordance with the instant invention can be constructed 

using known techniques, such as those reviewed in Hitt et al, 1997 "Human 
Adenovirus Vectors for Gene Transfer into Mammalian Cells" Advances in 
Pliarmacology 40:137-206, which is hereby incorporated by reference. 

In constructing the adenoviral vectors of this invention, it is often convenient 

25 to insert them into a plasmid or shuttle vector. These techniques are known and 
described in Hitt et al.. supra. This mvention specifically includes both the 
adenovims and the adenovirus when inserted into a shuttle plasmid. 

Preferred shuttle vectors contain an adenoviral portion and a plasmid portion. 
The adenoviral portion is essentially the same as the adenovirus vector discussed 

30 supra, containing adenoviral sequences (with non-functional or deleted El and E3 
regions) and the gene expression cassette, flanked by convenient restriction sites. The 
plasmid portion of the shuttle vector often contains an antibiotic resistance marker 
under transcriptional control of a prokaryotic promoter so that expression of the 
antibiotic does not occur in eukaryotic cells. Ampicillin resistance genes, neomycin 

35 resistance genes and other pharmaceutically. acceptable antibiotic resistance markers 
may be used. To aid in the high level production of the polynucleotide by 
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fomentation in prokaryotic organisms, it is advantageous for the shuttle vector to 
contain a prokaryotic origin of replication and be of high copy number. A number of 
commercially available prokaryotic cloning vectors provide these benefits. It is 
desirable to remove non-essential DNA sequences. It is also desirable that the vectors 
5 not be able to replicate in eukaryotic cells. Tliis minimizes the risk of integration of 
polynucleotide vaccine sequences into the recipients* genome. Tissue-specific 
promoters or enhancers may be used whenever it is desirable to limit expression of the . 
polynucleotide to a particular tissue type. 

In one embodiment of this invention, the pre-plasmids (e.g., pMRKAdSpol, 

10 pMRKAdSnef and pMRKAdSgag were generated by homologous recombination 
using the MRKHVE3 (and MRKHVO for the E3- version) backbones and tiie 
appropriate shuttle vector, as shown for pMRKAdSpol in Figure 22 and for 
pMRKAdSnef in Figure 23. The plasmid in linear form is csqpable of replication after 
entering the PER.C6® cells and virus is produced. The infected cells and media were 

15 harvested after viral replication was complete. 

Viral vectors can be propagated in various El complementing cell lines, 
including the known cell lines 293 and PER.C6®. Both these cell lines express the 
adenoviral El gene product. PER.C6® is described in WO 97/00326 (published 
January 3, 1997) and issued U.S. Patent No. 6,033,908, botii of which are hereby 

20 incorporated by reference. It is a primary human retinoblast cell line transduced with 
an El gene segment that complements the production of replication deficient (FG) 
adenovirus, but is designed to prevent generation of replication competent adenovirus 
by homologous recombination. Cells of particular interest have been stably 
transformed with a transgene that encodes the ADSEl A and ElB gene, like PER.C6®, 

25 firom 459 bp to 3510 bp inclusive. 293 cells are described in Graham et al., 1977 7. 
Gen. Virol 36:59-72, which is hereby incorporated by reference. As stated above, 
consideration must be given to the adenoviral sequences present in the 
complementing cell line used. It is important that the sequences not overlap with that 
present in the vector if the possibility of recombination is to be minimized. 

30 It has been found that vectors generated in accordance with the above 

description are more effective in inducing an immune response and, thus, constitute 
very promising vaccine candidates. More particularly, it has been found that first 
generation adenoviral vectors in accordance with the above description carrying a 
codon-optimized HIV gag gene, regulated with a strong heterologous promoter can be 

35 used as human anti-HTV vaccines, and are capable of inducing immune responses. 
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Standard techniques of molecular biology for preparing and puri^dng DNA 
constructs enable the preparation of the DNA immunogens of this invention. 

A vaccine composition comprising an adenoviral vector in accordance with 
the instant invention may contain physiologically acceptable components, such as 
5 buffer, normal saline or phosphate buffered saline, sucrose, other salts and 

polysorbate. One preferred formulation has: 2.5-10 mM TRIS buffer, preferably 
about 5 mM TRIS buffer, 25-100 mM NaCl, preferably about 75 mM NaQ; 2.5-10% 
sucrose, preferably about 5% sucrose; 0.01 -2 mM MgCh; and 0.001%-0.01% 
polysorbate 80 (plant derived). The pH should range from about 7.0-9.0, preferably 

10 about 8.0. One skilled in the ait will appreciate that other conventional vaccine 
excipients may also be used it make the formulation. The preferred formulation 
contains 5mM TRIS, 75 mM NaQ, 5% sucrose, ImM MgCh, 0.005% polysorbate 80 
at pH 8.0 This has a pH and divalent cation composition which is near the optimum 
for Ad5 stability and nnnimizes the potential for adsorption of virus to a glass surface. 

15 It does not cause tissue irritation upon intramuscular injection. It is preferably frozen 
imtiluse. 

The amount of adenoviral particles in the vaccine composition to be 
introduced into a vaccine recipient will depend on the strength of the transcriptional 
and translational promoters used and on the immunogenicity of the expressed gene 

20 product. In general, an immunologically or prophylactically effective dose of 1x10^ 
to 1x10^^ particles and preferably about 1x10^^ to 1x10^' particles is administered 
directly into muscle tissue. Subcutaneous injection, intradermal introduction, 
impression through the skin, and other modes of administration such as 
intraperitoneal, intravenous, or inhalation delivery are also contemplated. It is also 

25 contemplated that booster vaccinations are to be provided. Following vaccination 
with HIV adenoviral vector, boosting with a subsequent HIV adenoviral vector and/or 
plasmid may be desirable. Parenteral administration, such as intravenous, 
intramuscular, subcutaneous or other means of administration of interleukin-12 
protein, concurrently with or subsequent to parenteral introduction of the vaccine 

30 compositions of this invention is also advantageous. 

The adenoviral vector and/or vaccine plasmids of this invention 
polynucleotide may be unassociated with any proteins, adjuvants or other agents 
which impact on the recipients' inomune system. In this case, it is desirable for the 
vector to be in a physiologically acceptable solution, such as, but not limited to, sterile 

35 saline or sterile buffered saline. Alternatively, the vector may be associated with an 
adjuvant known in the art to boost immune responses (i.e., a "biologically effective" 



35 



wo 02/220g0 



PCT/USOl/28861 



adjuvant), sach as a protein or other earner. Vaccine plasmids of this invention may, 
for instance, be delivered in saline (e.g., PBS) with or without an adjuvant Preferred 
adjuvants are Alum or CRL1005 Block Copolymer. Agents which assist in the 
cellular uptake of DNA, such as, but not limited to, calcium ions, may also be used to 
5 advantage. These agents are generally referred to herein as transfection facilitating 
reagents and phaimaceutically acceptable carriers. Techniques for coating 
imcroprojectiles coated with polynucleotide are known in the art and axe also useful in 
connection with this invention. 

This invention also includes a prime and boost regimen wherein a first 

10 adenoviral vector is administered, then a booster dose is given. The booster dose miay 
be repeated at selected time intervals. Alternatively, a preferred inoculation scheme 
comprises priming with a first adenovirus serotype and then boosting with a second 
adenovirus serotype. More preferably, the inoculation scheme comprises priming 
with a first adenovirus serotype and then boosting with a second adenovirus serotype, 

15 wherein the first and second adenovims serotypes are classified within separate 

subgroups of adenoviruses. The above prime/boost schemes are particularly preferred 
in those situations where a preexisting inmunity is identified to the adenoviral vector 
of choice. In this type of scheme, the individual or population of individuals is 
primed with an adenovirus of a serotype other than that to which the preexisting 

20 immunity is identified This enables the first adenovirus to effectuate sufficient 

expression of the transgene while evading existing immunity to the second adenovims 
(the boosting adenovirus) and, fiirther, allows for the subsequent delivery of the 
transgene via the boosting adenovirus to be more effective. Adenovirus serotype 5 is 
one example of a virus to which such a scheme might be desirable. In accordance 

25 with this invention, therefore, one might decide to prime with a non-group C 

adenovirus (e.g., Adl2, a group A adenovirus, Ad24, a group D adenovirus, or Ad35, 
a group B adenovirus) to evade anti-Ad5 immunity and then boost with Ad5, a group 
C adenovirus. Another preferred embodiment involves administration of a different 
adenovirus (including non-human adenovims) vaccine followed by administration of 

30 the adenoviral vaccines disclosed. In the alternative, a viral antigen of interest can be 
first delivered via a viral vaccine other than an adenovirus-based vaccine, and then 
followed with the adenoviral vaccine disclosed. Alternative viral vaccines include but 
are not limited to pox virus and Venezuelan equine encephilitis virus. 

A large body of human and animal data supports the importance of cellular 

35 immune responses, especially CTL in controlling (or eliminating) HTV infection. In 
humans, very high levels of CTL develop following primary infection and correlate 
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10 



with the control of viiemia. Several small groups of individuals have been described 
who are repeatedly exposed to HTV by remain uninfected; CTL has been noted in 
several of these cohorts. In the SIV model of HIV infection, CTL similarly develops 
following pdmary infection, and it has been demonstrated that addition of anti-CD8 
monoclonal antibody abrogated this control of infection and leads to disease 
progression. This invention uses adenoviral vaccines alone or in combination with 
plasmid vaccines to induce CTL. 

The following non-limiting Examples are presented to better illustrate the 

invention. 



EXAMPLE 1 

Removal of the Intron A Portion of the hCMV Promoter 
GMP grade pVUnsHIVgag was used as the starting material to amplify the 
hCMV promoter. PVUnsHIVgag is a plasmid comprising the CMV immediate-early 

15 (E) promoter and intron A, a fuU-length codon-optimized HTV gag gene, a bovine 
growth hormone-derived polyadenylation and transcriptional termination sequence, 
and a minimal pUC backbone; see Montgomery et oL , supra for a description of the 
plasmid backbone. The amplification was performed with primers suitably positioned 
to flank the hCMV promoter. A 5' primer was placed upstream of the Mscl site of 

20 the hCMV promoter and a 3 V primer (designed to contain the Bgl£L recognition 

sequence) was placed 3 ' of the hCMV promoter. The resulting PGR product (using 
high fidelity Tag polymerase) which encompassed the entire hCMV promoter (minus 
intron A) was cloned into TOPO PGR blunt vector and then removed by double 
digestion with Mscl and BgZIL This firagment was then cloned back into the original 

25 GMP grade pVlJnsHTVgag plasmid from which the original promoter, intron A, and 
the gag gene were removed following Mscl and BglE digestion. This ligation 
reaction resulted in the construction of a hCMV promoter (minus intron A) + bGHpA 
expression cassette within the original pVUnsHIVgag vector backbone. This vector 
is designated pVIJnsCMV(no intron). 

30 The ELgag gene was excised from pVUnsHIVgag using BglH digestion and 

the 1,526 bp gene was gel purified and cloned into pVUnsCMV (no intron) at the 
BglR site. Colonies were screened using Smal restriction enzymes to identify clones 
that carried the Flgag gene in the correct orientation. This plasmid, designated 
pVl JnsCMV(no intron)-FLgag-bGHpA. was fully sequenced to confirm sequence 

35 integrity. 
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Two additional transgenes were also constructed. The plasmid, 
pVlJnsCMV(no mtron)-FLgag-SPA, is identical to pVlJnsCMV(no intron)-FLgag- 
bGHpA except that the bovine growth hormone polyadenylation signal has been 
replaced with a short synthetic polyA signal (SPA) of 50 nucleotides in length. The 
5 sequence of the SPA is as shown, with the essential components (poly(A) site, (GT)n, 
and (T)n; respectively) underlined: 

AATAAAA GATCmTATTITCATTAGATC TGTGTG TTGGTTTTTTGTGTG 
{SEQIDNO:18). 

The plasmid, p VlJns-mCMV-FLgag-bGHpA, is identical to the 
10 pVlJnsCMV(no intron)-ELgag-bGHpA except that the hCMV promoter has been 
removed and replaced with the murine CMV (mCMV) promoter. 

Figure 3 diagrammatically shows the new transgene constructs in comparison 
with the original transgene. 

15 EXAMPLE 2 

Gag Expression Assay for Modified Gag Transgenes 
Gag Elisa was performed on culture supematants obtained from transient 
tissue culture transfection experiments in which the two new hCMV-containing 
plasmid constructs, pVl JnsC!MV(no intron)-ELgag-bGI^A and pVlJnsCMV(no 
20 intron)-FLgag-SPA, both devoid of intron A, were compared to pVl JnsHTVgag 

which, as noted above possesses the intron A as part of the hCMV promoter. Table 2 
below shows the in vitro gag expression data of the new gag plasmids compared with 
the GMP grade original plasmid. The results displayed in Table 2 show that both of 

■ 

the new hCMV gag plasmid constructs have expression capacities comparable to the 
25 original plasmid construct which contains Ae intron A portion of the hCMV 
promoter. 
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Table 2: In vitro DNA transfection of original and new plasmid HTV-l gag constructs. 



Plasmid 


MR gag/10e6 COS cells/5fig DNA/48 hr 


HIVFL^gagPR9901" 


10.8 


PVUns-hCMV-FLgag-bGHpA'* 


16.6 


pVlJns-hCMV-ELgag-SPA'^ 


12.0 



* GMP grade pVl Jns-hCMVintronA-ELgag-bGHpA. 
5 ^ New plasmid constructions that have the intron A portion removed from the hCMV 
promoter. 

^ In this construct the bGH terminator has been replaced with the short synthetic 
polyadenylation signal (SPA) 



10 EXAMPLES 

Rodent (Balh/c) Study for Modified gag Transgenes 
A rodent study was performed on the two new plasmid constructs 
described above - pVl JnsCMV(no intron)-ELgag-bGHpA and pVl JnsCMV (no 
intron)-FLgag-SPA - in order to compare them with the construct described above 
15 possessing the intron A portion of the CMV promoter, pVl JnsHTV gag. Gag antibody 
and Elispot responses (described in PCX International Application No. 
PCT/USOO/18332 (WO 01/02607) filed July 3, 2000, claiming priority to U.S. 
Provisional Application Serial No. 60/142,631, filed July 6. 1999 and U.S. 
Application Serial No. 60/148,981, iBIed August 13, 1999, all three applications which 
20 are hereby incorporated by referaice) were measured. The results displayed in Table 
3 below, show that the new plasmid constructs behaved equivalently to the original 
. construct in Balb/c mice with respect to their antibody and T-cell responses at both 
dosages of plasmid DNA tested, 20 [Xg and 200 jig. 
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EXAMPUE4 

Table 3: H1V1 91 : Immunogenicity of VUns-gag under different promoter and termfnatfon control 
elements. 



DNA" 

Promoter/terminator 


Dose, 
ug** 


Ant}-p24 Titers 
(3 Wk PDIf 


SFC/IO'^ Cells 
(4WkPD1)'' 


GMT 


+SE 


-SE 


Media 


gag197-205 


P24 


HIVFL-gagPRDDOl 


200 


12800 


4852 


3412 


2(2) 


129(19) 


30(11) 


(GIMP grade) 


20 


5572 


1574 


1227 


0 


56(9) 


25(6) 


pVIJns-hCMV- 


200 


11143 


2831 


2257 


0 


98(5) 


12(6) 


FL-gag-bGHpA 


20 


7352 


2808 


2032 


0 


73(9) 


11(6) 


pVUns-hCMV- 


200 


16890 


5815 


4326 


1(1) 


94(4) 


26(7) 


FL-gag-SPA 


20 


5971 


5361 


2825 


0 


85(17) 


38(10) 


Naive 


0 


123 


50 


36 


0 


0 


0 



"in PBS 



^.m. Injecfions into both quads, 50 |iL per quad 
"nsl 0;GMT, geometric mean liter; SE, standard, error 

''n==5, pooled spleens; mean of triplicate wsHs and standard, deviation, in parentheses; 

Construction of the Modified Shuttle Vector -'MRKpdelEl Shuttle" 
The modifications to the original Ad5 shutfle vector (pdelElsplA; a vector 
comprising Ad5 sequences from basepairs 1-341 and 3524-5798, with a multiple 
5 cloning region between nucleotides 341 and 3524 of Ad5, included the following 
three manipulations carried out in sequential cloning steps as follows: 
(1) The left ITR region was extended to include the Pad site at the junction between 
the vector backbone and the adenovirus left ITR sequences. This allow for easier 
manipulations using the bacterial homologous recombination system. 
10 (2) The packaging region was extended to include sequences of the wild-type (WT) 
adenovirus from 342 bp to 450 bp inclusive. 

(3) The area downstream of pIX was extended 13 nucleotides (i.e., nucleotides 3511- 
3523 inclusive). 

These modifications (Figure 4) effectively reduced the size of the El deletion without 
15 overlapping with any part of the El A/EIB gene present in the transformed PER.C6® 
cell line. .All manipulations were performed by modifying the Ad shuttle vector 
pdelElsplA. 

Once the modifications were made to the shuttle vector, the changes were 
incorporated into the original Ad5 adenovector backbones (pAdHVO and pAdHVE3) 
20 by bacterial homologous recombination using E. coli BJ5183 chemically competent 
cells. 
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EXAMPLES 

Construction of Modified Adeno vector Backbones ffi3+ and E3-) 
The original adenovectois pAdHVO (comprising all Ad5 sequences except 
those nucleotides encompassing the El and E3 regions ) and pADHVE3 (comprising 
5 all Ad5 sequences except those nucleotides encompassing the El region), were each 
reconstructed so that they contained the modifications to the El region. TTiis was 
accomplished by digesting die newly modified shuttle vector (MRKpdelEl shuttle) 
with Pacl BiidBstZllOl and isolating the 2,734 bp fi:agraent which corresponds to the 
adenovirus sequence. This fragment was co-transformed with DNA firom either Clal 

10 linearized pAdHVO (E3- adenovector) or Clal linearized pAdHVE3 

(E3+adenovector) into E. coli BJ5183 competent cells. At least two colonies from 
each transformation were selected and grown in Terrific™ btotiht for 6-8 hours until 
turbidity was reached. DNA was extracted from each cell pellet and then transformed 
into E. coli XLl competent cells. One colony firom each transformation was selected 

15 and grown for plasmid DNA purification. The plasmid was analyzed by restriction 
digestions to identify correct clones. Hie modified adenovectors were designated 
MRKpAdHVO 023- plasmid) and MRKpAdHVE3 (E3+ plasmid). Virus firom these 
new adenovectors (MRKHVO and MRKHVE3, respectively) as well as the old 
version of the adenovectors were generated in the PER.C6® cell lines to accommodate 

20 the following series of viral competition experiments. In addition, the multiple 
cloning site of the original shuttle vector contained Clal , BamEI, Xho I, EcoRV, 
Hindm, Sal I, and Bgl n sites. This MCS was replaced with a new MCS containing 
Not I, Cla I, EcoRV and Asc I sites. This new MCS has been transferred to the 
MRKpAdHVO and MRKpAdHVE3 pre-plasmids along with the modification made 

25 to the packaging region and pK gene. 

EXAMPLE6 

Analysis of the Effect of the Packaging Signal Extension 
To study the effects of the modifications made to the El deletion region, the 
30 viruses obtained from the original backbone (pAdHVE3) and the new backbone 
(MRKpAdHVES) were mixed together in equal MOI ratios (1 : 1 and 5:5) and 
passaged through several rounds; see Figure 5, Expt.#l. Both of the viruses in the 
experiment contained the E3 gene intact and did not contain a transgene. The only 
difference between the two viruses was within the region of the El deletion. 
35 Following the coinfection of the viruses at PI (passage 1), the mixtures were 

. propagated through an additional 4 passages at which time the cells were harvested 
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and the virus extracted and purified by CsCl banding. The viral DNA was extracted 
and digested with HindSR and the digestion products were then radioactively labeled. 
For the controls, the respective pre-plasmids (pAdHVES ("OLD E3+"); 
MRKpAdHVES fNEW E3+'0) weie also digested with HindHI (and Pacl to remove 
the vector baclcbdne) and subsequently labeled with VjdATP. Hie radioactively 
labeled digestion products were subjected to gel electrophoresis and the gel was dried 
down onto Whatman pap^ before being exposed to autoradiographic film. Figure 6 
clearly shows that the new adenovirus which has the addition made to the packaging 
signal region has a growth advantage compared with the original adenovirus. In the 
experiments performed (at either ratio tested), only the digestion bands pertaining to 
the newly modified virus were present. The diagnostic band of size 3,206 (firom the 
new virus) was clearly present. However, there was no evidence of the diagnostic 
band of size 2,737 bp expected from the original virus. 

EXAMPUE? 
Analysis of the Effect of the E3 Gene 
The second set of the virus competition study involved mixing equal MOI 
ratio (1:1) of the newly modified viruses, that obtained from MRKpAdHVO and 
MRKpAdHVE3 (Figure 5, Expt. #2). In this set, both viruses had the new 
modifications made to the El deletion. The first virus (that from MRKpAdHVO) 
does not contain an E3 gene. The second virus (that from MRKpAdHVE3) does 
contain the E3 gene. Neither of the viruses contain a transgene. Following co- 
infection of the viruses, the mixtures were propagated through an additional 4 
passages at which time the cells were harvested and the total virus extracted and 
purified by CsCl banding. The viral DNA was extracted and digested with HifuSR 
and the digestion products were then radioactively labeled. For the controls, the 
respective pie-plasmids MRKpAdHVO ("NEW E3-"); MRKpAdHVE3 ("NEW 
E3+") were also digested with Hindni (and Pacl to remove the vector backbone) and 
then labeled with [33p]dATP. The radioactively labeled digestion products were 
subjected to gel electrophoresis and the gel was dried down onto Whatman paper 
before being exposed to autoradiographic film. Figure 6 shows the results of the viral 
DNA analysis of the E3+ virus and E3- virus mixing experiment. The diagnostic 
band corresponding to the E3+ virus (5,665 bp) was present in greater amount 
compared with the diagnostic band of 3,010 bp corresponding to the E3- virus. This 
indicates that the virus that contains the E3 gene is able to amplify more rapidly 
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compared with the virus that does not contain an £3 gene. This increased 
amplification capacity has been confirmed by growth studies; see Table 4 below. 

EXAMPLE8 

Construction of the new shuttle vector cnntaininp r nnrlTfieri pap tranfi gene - 

"MRKpdelEl^CMVfDO intronVFLgag-bGHpA" 
The modified plasmid pVlJnsCMV(no intn>n)-FLgag-bGI^A was digested 
with M^cl overnight and then digested with ,^1 for 2 hours at 50°C. Th& DNA was 
then treated with Mungbean nuclease for 30 nuns at 30°C. The DNA mixture was 
desalted using the Qiaex n kit and then Klenow treated for 30 mins at 37**C to fiilly 
blunt the ends of the transgene fragment The 2,559 bp transgene fragment was then 
gel purified. The modified shuttle vector (MRKpdelEl shuttle) was linearized by 
digestion with EcoRV, treated with calf intestinal phosphatase and the resulting 6,479 
bp fi-agment was then gel purified The two purified fragments were then ligated 
together and several dozen clones were screened to check for insertion of the 
transgene withm the shuttle vector. Diagnostic restriction digestion was performed to 
identify those clones carrying the transgene in the El parallel and El anti-parallel 
orientation. This strategy was followed to clone in the other gag transgenes in the 
MRKpdelEl shuttle vector. 

EXAMPLE9 
Construction of the MRK FG Adenovectors 
The shuttle vector containing the HTV-l gag transgene in the El parallel 
orientation, MRKpdelEl -CMV(no intron>ELgag-bGHpA, was digested with PacL 
The reaction mixture was digested with BsfZllL Hie 5,291 bp fragment was purified 
by gel extraction. The MRKpAdHVE3 plasmid was digested with Clal overnight at 
37^ and gel purified. About 100 ng of the 5,290 bp shuttle +transgene fi-agment and 
-100 ng of linearized MRKpAdHVE3 DNA were co-transformed into E. coli BJ5183 
chemically competent cells. Several clones were selected and grown in 2 ml 
Terrific™ broth for 6-8 hours, until turi>idity was reached. The total DNA from the 
cell pellet was purified using Qiagen alkaline lysis and phenol chloroform method. 
The DNA was precipitated with isopropanol and resuspended in 20 |il dHiO. A 2 fil 
aliquot of this DNA was transformed into E. coli XL-1 competent cells. A single 
colony from each separate transformation was selected and grown overnight in 3 ml 
LB +100 jig/ml ampicillin. The DNA was isolated using Qiagen colmnns. A positive 
clone was identified by digestion with the restriction enzyme BstESl which cleaves 
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within the gag gene as well as the plasmid backbone. The pre-plasmid clone is 
designated MRKpAdHVE3+CMV(no intron)-ELgag-bGHpA and is 37,498 bp in size. 
ITiis strategy was followed to generate E3- and E3+ versions of each of the other gag 
transgene constructions in both El parallel and El anti-parallel versions. Figuies 7A, 
7B and 7C show the various combinations of adenovectors constructed. 

EXAMPLE 10 
Plasmid Competition Studies 

A series of plasmid competition studies was carried out Briefly, the screening 
of the various combinations of new constructs was perfonned by mixing equal 
amounts of each of two competing plasmids. In the experiment shown in Figure 8 A, 
plasmids containing the same transgene but in different orientations were mixed 
together to create a "competition" between the two plasmids. The aim was to look at 
the effects of transgene orientation. In the experiment shown in Figure 8B, plasmids 
containing different polyadenylation signals (but in the same orientation) were mixed 
together in equal amounts. The aim was to assess effects of polyA signals. Following 
the initial transfection, the virus was passaged through ten rounds and the viral DNA 
analyzed by radioactive restriction analysis. 

Analysis of the viral species firom the plasmid mixing experiment (Figure 8A) 
showed that adenovectors which had the transgene inserted in the El parallel 
orientation amplified better and were able to out-compete the adenovirus which had 
the transgene inserted in the El anti-parallel orientation. Viral DNA analysis of the 
mixtures at passage 3 and certainly at passage 6, showed a greater ratio of the virus 
carrying the transgene in the El parallel orientation compared widi the El antiparallel 
version. By passage 10, the only viral species observed was the adenovector with the 
transgene in the El parallel orientation for both transgenes tested (hCMV(no intron)- 
FLgag-bGHpA and hCMV(no intron)-FLgag-SPA). 

Analysis of the viral species from the plasmid mixing experiment #2 (Figure 
8B) at passages 3 and 6 showed that the polyadenylation signals tested (bGHpA and 
SPA) did not have an effect on the growth of the virus. Even at passage 10 the two 
viral species in the mixture were still present in equal amounts. 
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EXAMPLE 11 

Virus generation of an enhanced adenoviral construct - 'MRK AdS HIV-lga|g" 
The results obtained from the competition study aUowed us to make the 
following conclusions: (1) The packaging signal extension is beneficial; (2) Presence 
of E3 does enhance viral growth; (3) El parallel orientation is recommended; and (4) 
PolyA signals have no effect on the growth of the adenovirus. 

MRK AdS EHV-l gag exhibited the most desirable results. This construct 
contains the hCMV(no intron)-FLgag'bGI^A transgene inserted into the new E3+ 
adenovector backbone, MRKpAdHVEB, in the El parallel orientation. We have 
designated this adenovector MRK AdS HIV-1 gag. This construct was pr^ared as 
outlined below: 

The pre-plasmid MIlKpAdtIVE3+CMV(no intron)-FLgag-bGHpA was 
digested was Pacl to release the vector backbone and 3.3 ^g was transfected by 
calcium phosphate method (Amersham Pharmacia Biotech.) in a 6 cm dish containing 
PER.C6® cells at -60% confluence. Once CPE was reached (7-10 days), the culture 
was freeze/diawed three times and the ceU debris pelleted. 1 ml of this cell lysate was 
used to infect into a 6 cm dish containing PER.C6® cells at 80*90% confluence. Once 
CPE was reached, the culture was freeze/thawed three times and the cell debris 
pelleted. The cell lysate was then used to infect a IS cm dish containing PER.C6^ 
cells at 80-90% confluence. This infection procedure was continued and expanded at 
passage 6. The virus was then extracted from the cell pellet by CsCl method. Two 
bandings were performed (3-gradient CsCl foUowed by a continuous CsCl gradient). 
Following the second banding, the virus was dialyzed in A105 buffer. Viral DNA 
was extracted using pronase treatment followed by phenol chloroform. The viral 
DNA was then digested with HindSL and radioactively labeled with [33p]dATP. 
Following gel electrophoresis to separate the digestion products the gel was dried 
down on Whatman paper and then subjected to autoradiography. The digestion 
products were compared with the digestion products from the pre-plasmid (that had 
been digested with PacllHindSL prior to labeling). The expected sizes were 
observed, indicating that the virus had been successfully rescued. This strategy was 
used to rescue virus from each of the various adenovector plasmid constructs 
prepared. 
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EXAMPLE 12 
Stability Analyses 

To determine whether the yarious adenovectox constructs (e.g., MRK Ad5 
HIV-1 gag) show genetic stability, the viruses were each passaged continually. The 
S yiral DNA was analyzed at passages 3, 6 and 10. Each yims maintained its correct^ 
genetic structure. In addition, the stability of the MRK Ad5 HIV-1 gag was analyzed • 
under propagation conditions similar to that performed in large scale production. For 
this analysis, the transfections of MRK AdS HIV-1 gag as well as three other 
adenoviral vectors were repeated and the virus was purified at P3. The three other 

10 adenovectors were as follows: (1) that comprising hCMV(no intron)-Flgag with a 
bGHpA terminator in an E3- adenovector backbone; (2) that comprising hCMV(no 
intron)-Flgag with a SPA termination signal in an E3+ adenovector backbone, and 
that comprising a mCMV-Hgag with a bGHpA terminator in an E3+ adenovector 
backbone. All of the vectors have the transgene inserted in the El parallel orientation. 

15 Viral DNA was analyzed by radioactive restriction analysis to confirm that it was 
correct.before being delivered to fermentation cell culture for continued passaging in 
serum-free media. At P5 each of the four viruses were purified and the viral DNA 
extracted for analysis by the restriction digestion and radiolabeling procedure. This 
virus has subsequently been used in a series of studies (in vitro gag expression in COS 

20 cells, rodent study and rhesus monkey study) as will be described below. The viruses 
from P5 are shown in Figure 9. 

The passaging under serum-free conditions was continued for the 
MRKHVE3 (transgene-less, obtained from MRKpAdHVE3 pre-plasmid) and the 
MRKAdSHIV-lgag (obtained from MRKpAdHVE3+CMV(no intron>FLgag- 

25 bGHpA pre-plasmid) viruses. Figure 10 shows viral DNA analysis by radioactive 
restriction digestion at passage 11 for MRKHVE3, MRKAd5H[V-lgagE3-, and 
passage 1 1 and 12 for MRKAdSHTV-lgag. Aside from the first lane which is the 
DNA marker lane, the next three lanes are virus from the pre-plasmid controls 
(controls based on the original virus) - MRKpAdHVE3 (also referred to as 

30 "pMRKHVE3"), MElKpAdHVE3+CMV(no intron)-FLgag-bGHpA, and 

pMRKAd5gag(E3-), respectively. As seen in Figure 10, each of the viral DNA 
samples show the expected bands with no extraneous bands showing. This signifies 
that there are no major variant adenovirus species present that can be detected by 
autoradiography. 

35 Figure 1 1 shows the results of viral competition study between MRKHVE3 

and MRKAd5HIV-lgag. These viruses were mixed together at equal MOI (140 viral 
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particles each; 280 vp total) at passage 6 and continued to be passaged until Pll. 
Aside from the first lane which is the DNA marker lane, the next two lanes are the 
pre~plasmid controls obtained from MRKpAdHVES and MRKpAdHVE3+CMV(no 
intrDn)-ELgag-bGHpA. The next two lanes are the viral DNA from the startmg viral 
5 material at passage six. The last two lanes are the competition studies performed in 
duplicate. The data in Figure 11 shows the effect the gag transgene in culture. 
Growth of a MRKAd5gag virus was compared with growth of a "transgene-less" 
MRKHVE3. These two vmises were infected at the same MOI (i.e. 140 vp each) at 
passage 6 and then passaged through to passage 1 1 and the viral pool was analyzed by 
10 radioactive restriction analysis. The data shows feat one virus did not out compete the 
other. Therefore, the gag transgene did not show obvious signs of toxicity to the 
adenovirus. 

Analysis by HindDI digestion shows that each virus specie is present in 
approximately equal amounts. As above, there does not appear to be signs of any 
15 extraneous bands. Figure 12 shows higher passage numbers for MRKAdSHTV-lgag 
grown under serum-containing conditions. Hie genome integrity again has been 
maintained and there is no evidence of rearrangements, even at the highest passage 
level (P21). 

Each of the four vectors shown in Figure 9 were analyzed for amplification 
20 capacity. Table 4 below shows the QPA analysis used in the estimation of viral 
amplification ratios at P4. The determination of the amplification ratio for the 
original HIV-1 gag construct is based on the clinical lot at P12. It has been shown 
that amplification rates increases with higher passage number for the original virus. 
The reason for this observation is due to the emergence of variants which exhibit 

« 

25 increased growth rates compared to the intact adenovector. With continued passaging 
of the original Ad gag vector, the level of variants increases and hence amplification 
rates increase also. 

The MRK AdS HIV-1 gag vims has also been continually passaged under 
process conditions (i.e., serum-free media). Viral DNA extracted from passages 1 1 

30 and 12 show no evidence of rearrangement. 
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Table 4: 

Amplification Ratios Based on AEX and QPA Analysis of 
Virus Amplification from Passage 3 to Passage 4. 



Ad gag c(mstnict _ _ 


. ..Amplification Ratio 


MRKAdSeag 


470 


HCMV-Flgag-bGHpA [E3-] 


115 


HCMV-Hgag-SPA [E3+] 


320 


mCMV-FLgag-bGHpA [E3+] 


420 


Original construct * 


40-50 



5 

^ This estimation is based on the clinical lot growth characteristics at Passage 12. 

EXAMPLE 13^ 

10 Analytical Evaluation of the enhanced Ad5 Constructs 

To study the effects of the transgene and the E3 gene on virus amplification, 
the enhanced adenoviral vector, MRK Ad5 HIV-l gag, along with its transgene-less 
version (MRKpAdHVE3) and its E3- version (MRK Ad5 HIV-l gag E30, was 
studied for several passages under serum-free conditions. Table 5A shows the 

15 amplification ratios determined for passages P3 to P8 for MRK Ad5 HTV-l gag. 
Within a certain MOI range, it has been determined that the virus output is directly 
proportional to the virus input. Therefore, the greater the number of virus particles 
per cell at infection, the greater the virus amount produced. Viral amplification ratios, 
on the other hand, are inversely proportional to the virus input The lower the virus 

20 input, the greater the amplification ratio. 

Table 5B shows the amplification rates of the new E3+ vector backbone 
MRKpAdHVES. It has a significantly lower rate of amplification compared with the 
gag transgene containing version. This may be contributed to the larger size MRK 
Ad5 HIV-l gag since it contains the transgene. This inclusion of the transgene brings 

25 the size of the adenovirus closer to the size of a wild type Ad5 vims. It is well known 
that adenoviruses amplify best when they are at close to their wild type genomic size. 
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Wild type Ad5 is 35,935 bp. The MRKpAdHVEB is 32, 905 bp in length. The 
enhanced adenovector MRK Ad5 HTV-l gag is 35,453bp (See Rgure 14 for vector 
map; see also Figure 15A-X show the complete pre-adenoviral vector sequence, 
which includes an additional 2,021 bp of the vector backbone).' 
5 Table 5C shows the amplification rates of the new E3- gag containing virus 

MRK Ad5 mV-l gag E3-. Once again, this virus shows lower growth rate than the 
enhanced adenoviral vector. This noiay be attributed to the decreased sized of this 
virus (due to the E3 gene deletion) compared with wild type Ad5. The MRK Ad5 
HTV-l gag E3- virus is 32,810 bp in length. Tliis can be compared with the wild type 
10 Ad5 which is 35,935 bp and MRK Ad5 HIV-1 gag which is 35,453 bp in length. 
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Table 5A: Amplification ratios determined by AEX and QPA for MRKAdSgag over several 
continuous passaging in semm free media. Following PS. two replicate samples were taken (rep- 
1 and rep-2) and analyzed. 



MRKAdSgag repl 





Xv(10rceQs/n 
infecUon 


Harvest 


HarvBslTims 


CenPassaga 
NuDibBr 


TBer 
lO^vpMcittiin 


Ttor 


OPA 
llf TCRWml 


Ratio 
ABttSPA 


AnnfAScaSon 
RaBD 


AEX ~ 
bitaniBlCcMMI 


P4 




a5a.&Q% 


44 


46 


&7 


&8 


1.72 


SO 


470 
A40U125) 




PS 




aj68.47% 


48 


40 


67 


4.8 


1.88 


40 


170 


pe 


1.M.94% 


0J68»77% 


47 


48 


&B 


&6 


1v42 


41 


200 


P7 


1^. 84% 


ae6,6i% 


49.5 


50 


3.0 


1.4 


097 


40 


SO 


P7 


1.09, 97% 


0.76.59% 


50 


52 


Si 


4.7 


1.70 


31 


170 


pa 


1.03, 94% 


0186.64% 


47.5 


54 


0.0 


a7 


t.lD 


82 


310 


p» 


0>89i 95% 


0199173% 


47.5 


56 


AA 


AS 


1X3 


43 


175 


ai2 

2M 




1.09, 01% 


1^)6166% 


47.£ 


SB 


io 


2A 


1.16 


28 


100 


2.70 

2.60 


pii 


1.19.88% 


0.98.65% 


47 


60 


3.6 


3.0 


1.15 


31 


110 


Z70 
2.70 


P12 


0.98. 91% 


0,85,63% 


47.5 


47 


5.4 


5.5 


1.20 


45 


200 


2.6E 
2.60 


P13 


1.00, 83% 


0.7a 67% 


49 


49 


5.8 


5.8 


1.11 


52 


210 


3.18 
3.16 




1J94.82% 


0JB8.67% 


46 


63 


6£ 


AA 






100 


9L28 
a27 


PtS 


0.97,06% 


a64,6e% 


47 


47 


6.0 


7.1 






250 


3.12 
£01 



Table 5B: Amplification ratios detemnined by AEX and QPA for MRKHVE3 over several 
continuous passaging in serum free media. MRKHVE3 is the new vector backbone which does 
NOT cany a transgene. 

MRKHVE3 





Xv (l(f ceOsAi 
Mecflon 


Harvest 


Harvest Tbne 
h.|U. 


CeD Passage 
Numtsr 


Titer 
lO^'vpftnlctiRurB 


Titer 


QPA 
lO^TCUVml 


Ratio 
AEXX3PA 


AmpSflcsdon 
Raflo 


ADC 
InlBTiui Oonvol 


P4 


1.10.97% 


1.28.78% 


49 


54 


4.1 


33 


1.70 


25 


300 
fMC3let29 




PS 


092.69% 


1.18^ 77% 


47 


46 

• 


43 


4.7 


134 


35 


170 


P6 


1^5. 68% 


1.26,78% 


48.5 


SO 


13 


03 


036 


21 


30 


P6 


1.09.07% 


1.11.81% 


48 


S2 


4.0 


33 


1.16 


34 


ISO 


P7 


1.17.91% 


1.22; 91% 


47.5 


54 


a7 


33 


030 


74 


110 


P8 


0.98.88% 


1.41.83% 


48 


56 


2.1 


21 


047 


45 


75 


3.12 
284 


P8 


120189% 


1.26,81% 


473 


58 


08 


Ow7 


039 


28 


26 


2.70 
2.60 


PtO 


0l99.82% 


1^5.88% 




60 


23 


23 


043 


S3 


80 


2.70 
2.70 


P11 


1.07,08% 


1.281,83% 


48 


47 


2.7 


23 


041 


66 


00 


2.86 

zeo 


P12 


0.60, 01% 


1.14,80% 


493 


49 


S3 


7.4 


046 


123 


2B0 


at8 

3.18 


P13 


1^8.95% 


1.14. 8S% 


453 


S3 


5.8 


33 






110 


338 
337 


PU 


0.97,88% 


1.03,98% 


483 


47 


9.4 


9.7 






SSO 


a.12 

231 


PtS 


067,99% 


0107.59% 


493 


49 


S3 


6.1 






218 


2.78 
2.52 
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Table 5C. AmpRflcation ratios determined by AEX and QPA for MRKAd5gag(E3-) over several 
continuous passaging in seaim free media. This construct is identical to the MRKAdSgag construct 
except that this version is DELETED of the E3 gene. 

5 

MRKAd5gag(E3) 





Xv(itf eeOsAi 
InSscdon 


iQ.VlabIQiy{%) 
Harvest 


HsivcstTbns 
h.|>.L 


CeOPassago 
Number 


ntar 
lO^vpAidaAure 


Titer 
10*i9)tee0 


QPA 


Raflo 
AEX.-QPA 


ArapEficsllon 
Ratto 


AEX 
Inttnia} Control 


P4 


1^77% 


1.12.62% 


473 


46 


2JD 


ije 


032 


20 


100 
(M0M25) 




P5 


1.16.92% 


062. 43% 


49 


49 


33 


23 


039 


34 


100 




P6 


1.71, 6S% 


020. 10% 


49 


90 


4.7 


2.7 


1.70 


28 


100 




P6 


1^97% 


0.63,54% 


49S 


S2 


6.4 


6.0 


1.76 


31 


180 




P7 


1.17,91% 


098.72% 


47^ 


54 


7.1 


6.1 


037 


106 


220 




PB 


098.88% 


077,4B% 


48 


SB 


3.1 


33 


036 


47 


lis 


3.12 
234 


P9 


1^B9% 


1.03.72% 


40 


58 


13 


13 


037 


32 


55 


2.70 
230. 


PtO 


0.99.82% 


0.80,62% 


A6J5 


60 


32 


32 


0^8 


47 


lis 


2-70 
2.70 


P11 


lJD7,g8% 


098170% 


483 


47 


53 


5.5 


068 


67 


200 


236 
230 


P12 


080.91% 


087,9% 


GO 


49 


5.1 


6l4 


072 


71 


230 


3.18 
a.18 


P13 


1.96.95% 


091.59% 


453 


53 


7.4 


&8 






135 


338 

337 


PU 


0l97.95% 


Ofil. 74% 


48 


47 


6.8 


73 






250 


ai2 

231 


PIS 


087,99% 


064,56% 


49 


49 


43 


53 






198 


2.78 
232 



EXAMPLE 14 
Gag Expression Analysis of the Novel Constructs 
In vitro gag analysis of the MRK Ad5 HIV-1 gag and the original HTV-gag 
10 vectors (research and clinical lot) show comparable gag expression. The clinical lot 
shows only a slightly reduced gag expression level. The most noticeable difference is 
with the mCMV vector. This vector shows roughly 3 fold lower expression levels 
compared with the other vectors tested (which all contain hCMV promoters). The 
mCMV-ELgag with bGHpA assay was performed three times using different 
15 propagation and purification lots and it consistently exhibited weaker gag expression. 

EXAMPLE 15 

Evaluation of MRK Ad5 HIV-1 gag and Other gag-Containing 
. - Adenovectors in Balb/c Mice 
20 Cohorts of 10 balb/c mice were vaccinated intramuscularly with escalating 

doses of MRK Ad5 HIV-1 gag, and the research and clinical lots of original 
AdSHTV-lgag. Serum samples were collected 3 weeks post dose 1 and analyzed by 
anti-p24 sandwich ELIS A. 
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Anti-p24 titers in mice that received MRK Ad5 HIV-1 gag (107 and 109 
vp(viral particle) doses) were comparable (Rgure 13) to those of the research lot of 
AdSmV-l gag, for which much of the early rhesus data were generated on. These 
titeas were also comparable when E3 is deleted (MRKAdShCMVgagbGHpA^-)) or 
5 SPA is substituted for bGHpA terminator (MRKAdS hCMV-gag-SPA (E3+)) or 
murine CMV promoter is used in place of hCMV (MRKAdS mCMV-gag-bGHpA 
(E3+)) in the MRKAdS backbone. 

The results shown in Table 7 indicate that the three other vectors ( in addition 
to the preferred vector, MRK AdS HIV-1 gag, are also capable of inducing strong 

10 anti-gag antibody responses in mice. Interestingly enough, while the mCMV-ELgag 
construct containing bOHpA and E3+ in an £1 parallel orientation showed lowest gag 
expression in the COS cell in vitro infection (Table 6) in comparison with the other 
vectors tested, it generated the greatest anti-gag antibody response this in vivo Balb/c 
study. Table 7 also shows a dose response in anti-gag antibody production in both the 

IS research and the clinical lot. As expected, the clinical lot shows reduced anti-gag 
antibody induction at each dosage level compared to the same dosage used for the 
research lot. 

Table 6: In vitro analysis for gag expression in COS cells by Elisa assay. 

20 



Viral Vectors* 


tig gag/4.8xl0e5 CXXS/lOeS paits/48hr 


MRKAdSgag* 


1.40 


Clinical lot Ad5gag° 


1.28 


Research lot AdSgag** 


1.32 


MCMVEL-gagbGHpA' 


0.42 



Azfionm absorbance readings taken for viral particle determinations. 
^ MRKAdSgag was produced in serum free conditions and purified at P5. 
" Clinical lot# AdSgagFNOOOl 
25 ^ Research AdSELgag lot# 6399 

' mCMVEL-gagbGHpA was produced in serum free conditions and purified at P5 . 
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Table 7: mHlV020 Anti-p24 Ab Titers in Balb/c mice (n=10) vaccinated with various Adgag 
constnicts and lots (3 weeic post dosel ). 



Group 
ID 


Vaccine 


Dose 


GMT 


SE upper 


SE lower 


1 
2 


'MRKAdSgag 

n 


10^7 
10A9 


25600 
409600 


5877 
94026 


4780 
76473 


3 
4 


hCMV l=L-gag bGHpA [E3-] -> 


10^7 
10A9 


7352 
235253 


2077 
59767 


1620 
47659 


5 
6 


hCMV I=L-gag SPA [E3+J -» 
■ 


10^7 
10^9 


12800 
310419 


. 9905 
99181 


236 
75165 


7 
8 


'^mCMV FL-gag bGHpA [E3+] 


10^7 
10^ 


44572 
941014 


23504 
239068 


15389 
190636 


9 

10 


*hCMV FL-gag bGHpA [E3-1 <- 
■ 


10^7 


3676 
117627 


934 
17491 


745 
15227 


11 

12 
13 
14 


research lot hCMV intronA FL-gag bGHpA [E3-] <- 

n 

n 
n 


10^ 
10^7 
10^ 
10^ 


528 
14703 
58813 
204800 


262 
5274 
14942 
53232 


175 
3882 
11915 
42250 


15 
16 
17 
18 


Clinical lot hCiViVintronA FL-gag bGHpA [E3-] <- 

a 
a 

B 


10^ 
10^7 
10^ 
10A9 


230 
4222 
19401 
89144 


82 
3405 
3939 
25187 


61 

1138 
3274 
19639 


19 


Narve 1 


none 


93 


7 


6 



*2x50iiLl.m. (quad) injections/anifnal 
P.ls: YouD, Chen, Casimiro 
Vacdnation: T. Toner, Q. Su 
Assay: M. Chan 



*'The stnjcture of MRKAdSgag is: hCMVFL-gagbGHpA [E3+] The same lot of MRKAd5gag 
used in this rodent study was used in the Rhesus monkey study (Tables 7 and 8). 
*^e same tot of mCMVFL-gagbGHpA[E3+] used In the in vitro study (Table 6) ws used here. 
^Is construct was designed by Volker Sandlg. It contains a shorter version of the hOMV 
promoter than that used in the MRK constructs. The adenovector backbone Is identical to the 
original backbone used in the original Adgag vector. Expression at 1 0e7 dose from this vector is 7 
fold lower then the same dose of the MRKAd5gag and 4 fold lower than the research lot. 



EXAMPLE 16 

Comparison of Humoral and Cellular Responses Towards the Original Ad-gag 
Construct with the New MRK Ad5 HIV-1 gag in Rhesus Monkeys 
5 Cohorts of 3 rhesus monkeys were vaccinated intramuscularly with MRK Ad5 

HIV-1 gag or the clinical AdSgag bulk at two doses, lO" vp and 10^ vp. 
Immunizations were conducted at week 0, 4, and 25. Serum and PBMC samples were 
collected at selected time points. The serum sample were assayed for anti-p24 Ab 
titers (using competitive based assay) and the PBMCs for antigen-specific IFN- 
10 gamma secretion following overnight stimulation with gag 20-mer peptide pool (via 
ELISpot assay). 

The results shown in Table 8 indicate comparable responses with respect to 
the generation of anti-gag antibodies. The frequencies of gag-specific T cells in 
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peiipheral blood assummarized in Table 9 demonstrate a strong cellular immune 
response generated after a single dose with the new construct MRK Ad5 HIV-1 gag. 
The responses are also boostable with second dose of the same vector. The vector is 
also able to induce CD8+ T cell responses (as evident by rranaining spot counts after 
CD4+ depletion of PBMCs) which are responsible for cytotoxic activity. 

Table 8 Anti-p24 antibody tita:s (in mMU/mL) in rhesus macaques immunized with 



gag'-expressing adenovectors (Protocol HIV203). 



Vaccine 


Pre 


Wk4 


Wk8 


Wk12 


Wk16 


Wk20 


Wk25 


Wk28 


MRICAd5aQQi''.rO''11 VP 


















97N010 


<10 


118 


5526 


11523 


7062 


21997 


ND 


51593 


97N116 


<10 


62 


772 


1447 


1562 


2174 


ND 


20029 


98X007 


<10 


66 


3353 


6156 


6845 


3719 


ND 


24031 




















MRKAdSgpci. \0''9vt> 


















97N120 


<'10 


61 


204 


318 


366 


462 


ND 


6550 


97N144 


<10 


16 


118 


274 


706 


868 


ND 


7136 


9aX008 


<10 


15 


444 


366 


996 


1072 


ND 


12851 






















^ainfcxlLo».10M1 VP 


















97X001 


<10 


87 


2579 


4718 


7174 


7250 


ND 


69226' 


97N146 


<10 


72 


3604 


7380 


7526 


18906 


ND 


60283 


9flX009 


<10 


78 


4183 


3946 


3124 


6956 


ND 


26226 
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97N020 


<10 


<10 


143 


371 


390 


1821 


ND 


17177 


97X003 


<10 


<10 


39 


93 


156 


596 


ND 


2053 


98X012 


<;1P 


81 


342 


717 


956 


1558 


ND 


11861 
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Table 9. Number of gag-specific T cells per million peripheral blood mononuclear 
cells (PBMCs) in rhesus monkeys immunized with gag-expressing adenovectors. 
Also included axe those frequencies in PBMCs depleted of CD4* T cells. 



GrpI 



Vacdndton 
T=n.d^wlci 



Monkey ID | TeAWlt T 



Ta6Wk 



97ND10 
97N010Cai4-) 

97Nn6 
97Nn6(C04-) 

92X007 
98X007CCD40 



6 
4 

1 

11 
10 
20 



89 
3& 
396 
676 
579 
965 



0 
1 
0 



ecqH 



395 
609 
1304 



Tel 



0 
3 
0 
0 
3 
0 



GOffH 



1058 

993 

S34 

593 

2193 

2675 



0 
4 
1 



GooH 



1174 
39S 
2116 



Ta2SWk 



3 
0 
1 
0 
3 
0 



soon 



775 

76 

261 

164 

1588 

1656 



4 
0 
0 
0 
0 
0 



GoaH 



1074 

594 

408 

666 

2113 

1278 



10^ VP 



97N120 
97N120(qD4-> 

97hn44 
97N144CCD4') 

98X008 
9SX008(CD4-> 



5 
11 
3 
6 
4 
14 



275 
170 
236 
MB 
36B 
696 



1 
S 
1 



240 
438 
1090 



4 
0 
1 
0 
3 
0 



Ml 
85 
318 
285 
891 
1175 



4 
3 
4 



119 
256 
673 



9 
0 
1 

ND 
3 
0 



206 

75 

98 

ND 

473 

391 



10^11 vp 



97XX1 
97X001CCO4.) 

97N146 
97N146CCD4.) 

96X009 
98X009CCD40 



0 

10 

3 

6 

0 

0 



261 
263 
150 
133 
63 
73 



1 
1 
3 



485 
465 
339 



0 
3 
0 
0 
3 
0 



617 
996 
339 
370 
559 
333 



0 
1 

0 



1220b 

1272 

696 



1 
0 
3 
0 
1 

0 



894 

1010 

1238 

654 

384 

225 



4 
1 
5 
0 
5 
4 



0 
0 

3 
0 
0 
0 



219 
219 
373 
625 
735 
648 



1858 

1125 

1785 

971 

1748 

644 



AcSoaodlnladld 
10^ vp 



97N020 
97N020CCD4-) 

97X003 
97XQ03CCO4-) 

96X012 
98XD12CCD4-) 



3 

10 

4 

9 
5 
11 



30 

68 
40 
85 
70 



1 

5 



101 
134 
54 



0 
0 
0 
0 
1 
0 



66 
15 
18 
6 
34 
11 



0 
1 

0 



36 
38 
18 



0 
0 
4 
0 
0 
0 



26 
1 

38 
4 
20 
6 



0 
0 
6 
0 
1 
0 



41 

16 

81 

19 

121 

41 



9aR041 
053F 



6 
14 



8 
18 



1 

5 



1 

16 



0 
20 



0 
14 



0 
19 



0 

IS 



0 
10 



0 
15 



1 

24 



0 
9 
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The adenovectors described herein and, particularly, MRK Ad5 HTV-l gag, 
represent very promising HIV-gag adenovectors with respect to their enhanced 
growth characteristics in both serum and, more importantly, in serum-free media 

10 conditions. In comparison with the current HIV-1 gag adenovector constmct, MRK 
Ad5 mV-l gag shows a 5-10 fold increased amplification rate. We have shown that 
it is genetically stable at passage 21. This construct is able to generate significant 
cellular immune responses in vivo even at a relatively low dose of 10^9 vp. The 
potency of the MElKAdSgag construct is comparable to, if not better than the original 

15 mV-lgag vector as shown in this rfiesus monkey study. 



20 



EXAMPLE 17 

CODON OPTIMIZED mV-l POL AITO CODON OPTIMZED 

mV-l POLMODMCAHONS 
The open reading frames for the various synthetic pol genes disclosed herein 
comprise coding sequences for the reverse transcriptase (or RT which consists of a 
polymerase and RNase H activity) and integrase (EN). The protein sequence is based 
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on that of Hxb2r, a clonal isolate of HEB; this sequence has been shown to be closest 
to the consensus clade B sequence with only 16 nonidentical residues out of 848 
(Koiba:, et al., 1998, Human retroviruses and AIDS, Los Alamos National 
Laboratory, Los Alamos, New Mexico). The skilled artisan will understand after 
5 review of this specification that aay available HIV-1 W HIV-2 strain provides a" 
potential template for the generation of HIV pol DNA vaccine constructs disclosed 
herein. It is further noted that the protease gene is excluded from the DNA vaccine 
constructs of the present invention to insure safety from any residual protease activity 
in spite of mutational inactivation. The design of the gene sequences for both wild- 

10 type (wt-pol) and inactivated pol (lA-poI) incorporates the use of human preferred 
C'humaiuzed") codons for each amino acid residue in the sequence in order to 
maximize in vivo mammalian expression (Lathe, 1985, /. MoL BioL 183:1-12). As 
can-be discerned by inspecting the codon usage in SEQ ID NOs: 1,3,5 and 7, the 
following codon usage for mammalian optimization is preferre± Met (ATG), Gly 

15 (GGC), Lys (AAG), Trp (TGG), Ser (TCC), Arg (AGG), Val (GTG). Pro (CCC), Thr 
(ACQ, Glu (GAG); Leu (CTG), His (CAC), He (ATC), Asn (AAC), Cys (TGC), Ala 
(GCC)i Gto (GAG), Phe (TTC) and Tyr (TAG). For an additional discussion relating 
to mammalian (human) codon optimization, see WO 97/31115 (PCT/US97/02294), 
which, as noted elsewhere in this specification, is hereby incorporated by reference. It 

20 is intended that the skilled artisan may use alternative versions of codon optimization 
or may omit this step when generating HIV pol vaccine constmcts within the scope of 
the present invention. Therefore, the present invention also relates to non^codon 
optimized versions of DNA molecules and associated recombinant adenoviral HIV 
vaccines which encode the various wild type and modified forms of the HEV Pol 

25 protein disclosed herein. However, codon optimization of these constructs is a 
preferred embodiment of this invention. 

A particular embodiment of this portion of the invention comprisies codon 
optimized nucleotide sequences which encode wt-pol DNA constracts (herein, "wt- 
por or "wt-pol (codon optimized))" wherein DNA sequences encoding the protease 

30 (PR) activity are deleted, leaving codon optimized "wild type" sequences which 

encode RT (reverse transcriptase and RNase H activity) and IN integrase activity, A 
DNA molecule which encodes this protein is disclosed herein as SEQ ID NO:l, the 
open reading firame being contained firom an initiating Met residue at nucleotides 10- 
12 to a termination codon from nucleotides 2560-2562. SEQ ID NO; 1 is as follows: 

35 AGATCTACCA TGGCCCCCAT CTCCCCCATT GAGACTGTGC CTGTGAAGCT GAAGCCTGGC 
ATGGATGGCC CCAAGGTGAA GCAGTGGCCC CTGACTGAGG AGAAGATCAA GGCCCTGGTG 
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GAAATCTGCA CTGAGATGO?^ GAAGGAGGGC 
TACAACACCC CTGTGTTTGC CATCAAGAAG 
GACTTCAGG6 AGCTGAACAA GAGGACCCA6 
CACCCCGCTG GCCTGAAGAA GAA6AAGTCT 
5 TTCTCTGTGC CCCTGGATGA GGACTTCAGG 
AACAATGAGA CCCCTGGCAT CAGGTACCA6 
TCCCCTGCCA TCTTGCAGTC CTCCATGACC 
CCTGACATTG TGATCTACCA GTACATGGAT 
GGGCAGCACA GGACCAAGAT TGAGGAGCTG 

10 ACCCCTGACA AGAAGCACCA GAAGGAGCCC 
CCCGACAAGT GGACTGTGCA GCCCATTGTG 
GACATCCAGA AGCTGGTGGG CAAGCTGAAC 
GTGAGGCAGC TGTGCAAGCT GCTGAGGG6C 
ACTGAGGAGG CTGAGCTGGA GCTGGCTGAG 

15 GGGGTGTACT ATGACCCCTC CAAGGACCTG 
CAGTGGACCT ACCAAATCTA CCAGGAGCCC 
A6GAT6AGGG GGGCCCACAC CAATGATGTG 
ACCACTGAGT CCATTGTGAT CTGGGGCAAG 
GAGACCTGGG AGACCTGGTG GACTGAGTAC 

20 TTTGT6AACA CCCCCCCCCT GGTGAAGCTG 
GGGGCTGAGA CCTTCTATGT GGATGGGGCT 
GGCTATGTGA CCAACAGGGG CAGGCAGAAG 
AAGACTGAGC TCCAGGCCAT CTACCTGGCC 
GTGACTGACT CCCAGTATGC CCTGGGCATC 

25 GAGCTGGTGA ACCAGATCAT TGAGGAGCTG 
GTGCCTGCCC ACAAGGGCAT TGGGGGCAAT 
ATCAGGAAGG TGCTGTTCCT GGATGGCATT 
CACTCCAACT GGAGGGCTAT GGCCTCTGAC 
ATTGTGGCCT CCTGTGACAA GTGCCAGCTG 

30 TGCTCCCCTG GCATCTGGCA GCTGGACTGC 
GCTGTGCATG TGGCCTCCGG CTACATTGAG 
GAGACTGCCT ACTTCCTGCT GAAGCTGGCT 
6ACAATGGCT CCAACTTCAC TGGGGCCACA 
AAGCAGGAGT TTGGCATCCC CTACAACCCC 

35 AAGGAGCTGA AGAAGATCAT TGGGCAGGTG 
GTGCAGATGG CTGTGTTCAT CCACAACTTC 



AAAATCTCCA AGATTGGCCC CGAGAACCCC 
AAGGACTCCA CCAAGTGGAG GAAGCTGGTG 
GACTTCTGGG AGGTGCAGCT GGGCATCCCC 
GTGACTGTGC TGGATGTGGG GGATGCCTAC 
AAGTACACTG CCTTCACCAT CCCCTCCATC 
TACAATGTGC TGCCCCAGGG CTGGAAGGGC 
AAGATCCTGG AGCCCTTCAG GAAGCAGAAC 
GACCTGTATG TGGGCTCTGA CCTGGAGATT 
AGGCAGCACC TGCTGAGGTG GGGCCTGACC 
CCCTTCCTGT GGATGGGCTA TGAGCTGCAC 
CTGCCTGAGA AGGACTCCTG GACTGTGAAT 
TGGGCCTCCC AAATCTACCC TGGCATCAAG 
ACCAAGGCCC TGACTGAGGT GATCCCCCTG 
AACAGGGAGA TCCTGAAGGA GCCTGT6CAT 
ATTGCTGAGA TCCAGAAGCA GGGCCAGGGC 
TTCAAGAACC TGAAGACTGG CAAGTATGCC 
AAGGAGCTGA CTGAGGCT6T GCAGAAGATC 
ACCCCCAAST TCAAGCTGCC CATCCAGAAG 
TGGCAGGCCA CCTGGATCCC TGAGTGGGAG 
TGGTACCAGC TGGA6AAGGA GCCCATTGTG 
GCCAACAGGG AGACCAAGCT GGGCAAGGCT 
GTGGTGACCC TGACTGACAC CACCAACCAG 
CTCCAGGACT CTGGCCTGGA GGTGAACATT 
ATCCAGGCCC AGCCTGATCA GTCTGAGTCT 
ATCAAGAAGG AGAAGGTGTA CCTGGCCTGG 
GAGCAGGTGG ACAAGCTGGT GTCTGCTGGC 
GACAAG6CCC AGGATGAGCA TGAGAAGTAC 
TTCAACCTGC CCCCTGTGGT GGCTAAGGAG 
AAGGGGGAGG CCATGCATGG GCAGGTGGAC 
ACCCACCTGG A6GGCAAGGT GATCCTGGTG 
GCTGAGGTGA TCCCTGCTGA GACAGGCCAG 
GGCAGGTGGC CTGTGAAGAC CATCCACACT 
GTGAGGGCTG CCTGCTGGTG GGCTGGCATC 
CAGTCCCAGG GGGTGGTGGA GTCCATGAAC 
AGGGACCAGG CTGAGCACCT GAAGACAGCT 
AAGAGGAAGG 6GG6CATCGG GGGCTACTCC 
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GCTGGGGA6A GGATTGTGGA CATCATTGCC ACAGACATCC ACSACCAAGGA GCTCCAGAAG 
CAGATCACCA AGATCCAGAA CTTCAGGGTG TACTACAGGG ACTCCAGGAA CCCCCTGTGG 
AAGGGCCCTG CCAAGCTGCT GTGGAAGGGG. GAGGGGGCTG TGGTGATCCA 6GACAACTCT 
GACATCAAGG TGGTGCCCA6 GAGGAAGGCC AAGATCATCA GGGACTATGG CAAGCAGATG 
5 GCT6GGGATG ACTGTGTGGC CTCCAGGCAG GATGAGGACT AAAGCCCGGG CAGATCT (SEQ 
ID N0:1) . 

The open reading frame of the wild type pol construct disclosed as SEQ ID 
N0:1 contains 850 amino acids, disclosed herein as SEQ ID N0:2, as follows: 

Met Ala Pro He Ser Pro He Glu Thr Val Pro Val Lys Leu I»ys Pro 

10 Gly Met Asp Gly Pro Lys Val Lys Gin Trp Pro Leu Thr Glu Glu Lys 
He Lys Ala Leu Val Glu He Cys Thr Glu Met Glu Lys Glu Gly Lys 
He Ser Lys He Gly Pro Glu Asn Pro Tyr Asn Thr Pro Val Phe Ala 
He Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys Leu Val Asp Phe Arg 
Glu Leu Asn Lys Arg Thr Gin Asp Phe Trp Glu Val Gin Leu Gly He 

15 Pro His Pro Ala Gly Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp 
Val Gly Asp Ala Tyr Phe Ser Val Pro Leu Asp Glu Asp Phe Arg Lys 
Tyr Thr Ala Phe Thr He Pro Ser He Asn Asn Glu Thr Pro Gly He 
Arg Tyr Gin Tyr Asn Val Leu Pro Glu Gly Trp Lys Gly Ser Pro Ala 
He Phe Gin Ser Ser Met Thr Lys He Leu Glu Pro Phe Arg Lys Gin 

20 Asn Pro Asp He Val He Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly 
Ser Asp Leu Glu He Gly Gin His Arg Thr Lys He Glu Glu Leu Arg 
Gin His Leu Leu Arg Trp Gly Leu Thr Thr Pro Asp Lys Lys His Gin 
Lys Glu Pro Pro Phe Leu Trp Met Gly Tyr Glu Leu His Pro Asp Lys 
Trp Thr Val Gin Pro He Val Leu Pro Glu Lys Asp Ser Trp Thr Val 

25 Asn Asp He Gin Lys Leu Val Gly Lys Leu Asn Trp Ala Ser Gin He 
Tyr Pro Gly He Lys Val Arg Gin Leu Cys Lys Leu Leu Arg Gly Thr 
Lys Ala Leu Thr Glu Val He Pro Leu Thr Glu Glu Ala Glu Leu Glu 
Leu Ala Glu Asn Arg Glu He Leu Lys Glu Pro Val His Gly Val Tyr 
Tyr Asp Pro Ser Lys Asp Leu He Ala Glu He Gin Lys Gin Gly Gin 

30 Gly Gin Trp Thr Tyr Gin He Tyr Gin Glu Pro Phe Lys Asn Leu Lys 
Thr Gly Lys" Tyr Ala Arg Met Arg Gly Ala His Thr Asn Asp Val Lys 
Gin Leu Thr Glu Ala Val Gin Lys He Thr Thr Glu Ser He Val He 
Trp Gly Lys Thr Pro Lys Phe Lys Leu Pro He Gin Lys Glu Thr Trp 
Glu Thr Trp Trp Thr Glu Tyr Trp Gin Ala Thr Trp He Pro Glu Trp 

35 Glu Phe Val Asn Thr Pro Pro Leu Val Lys Leu Trp Tyr Gin Leu Glu 
Lys Glu Pro He Val Gly Ala Glu Thr Phe Tyr Val Asp Gly Ala Ala 
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Asn 


Arg 


Glu Thr Lys Leu Gly 


Lys Ala Gly 


Tyr Val 


Thr Asn Arg 


Gly 




Arg 


Gin 


Lys Val Val Thr Leu 


Thr Asp Thr 


Thr 


Asn 


Gin 


Lys Thr 


Glu 




Leu 


Gin 


Ala lie Tyr Leu Ala 


Leu Gin Asp 


Ser Gly 


Leu 


Glu Val 


Asn 




lie 


Val 


Thr Asp Ser Gin Tyr 


Ala Leu Gly 


lie 


He 


Gin 


Ala Gin 


Pro 


5 


Asp 


Gin 


Ser Glu Ser Glu Leu 


Val Asn Gin 


He 


He 


Glu 


Gin Leu 


He 




Lys 


Lys 


Glu Lys Val Tyr Leu 


Ala Trp Val 


Pro 


Ala 


His 


Lys Gly 


He 




Gly 


Gly 


Asn Glu Gin Val Asp 


Lys Leu Val 


Ser 


Ala 


Gly 


He Arg 


Lys 




Val 


Leu 


Phe Leu Asp Gly lie 


Asp Lys Ala 


Gin Asp 


Glu 


His Glu 


Lys 




Tyr 


His 


Ser Asn Trp Arg Ala 


Met Ala -Ser 


Asp 


Phe 


Asn 


Leu Pro 


Pro 


10 


Val 


Val 


Ala Lys Glu lie Val 


Ala Ser Cys 


Asp 


Lys 


Cys 


Gin Leu 


Lys 




Gly 


Glu 


Ala Met His Gly Gin 


Val Asp Cys 


Ser 


Pro 


Gly 


He Trp 


Gin 




Leu 


Asp 


Cys Thr His Leu Glu 


Gly Lys Val 


He 


Leu 


Val 


Ala Val 


His 




Val 


Ala 


Ser Gly Tyr lie Glu 


Ala Glu Val 


He 


Pro 


Ala 


Glu Thr 


Gly 




Gin 


Glu 


Thr Ala Tyr Phe Leu 


Leu Lys Leu 


Ala Gly 


Arg 


Trp Pro 


Val 


15 


Lys 


Thr 


lie His Thr Asp Asn 


Gly Ser Asn 


Phe 


Thr 


Gly Ala Thr 


Val 


» 


Arg 


Ala 


Ala Cys Trp Trp Ala 


Gly lie Lys 


Gin 


Glu 


Phe Gly He 


Pro 




Tyr 


Asn 


Pro Gin Ser Gin Gly 


Val Val Glu 


Ser 


Met 


Asn 


Lys Glu 


Leu 




Lys 


Lys 


lie lie Gly Gin Val 


Arg Asp Gin 


Ala 


Glu 


His 


Leu Lys 


Thr 




Ala 


Val 


Gin Met Ala Val Phe 


He His Asn 


Phe 


Lys 


Arg Lys Gly 


Gly 


20 


lie 


Gly 


Gly Tyr Ser Ala Gly 


Glu Arg He 


Val 


Asp 


He 


lie Ala 


Thr 




Asp 


lie 


Gin Thr Lys Glu Leu 


Gin Lys Gin 


He 


Thr 


hys 


He Gin 


Asn 




Phe 


Arg 


Val Tyr Tyr Arg Asp 


Ser Arg Asn 


Pro 


Leu 


Trp 


Lys Gly 


Pro 




Ala 


Lys 


Leu Leu Trp Lys Gly 


Glu Gly Ala 


Val 


Val 


He 


Gin Asp 


Asn 




Ser 


Asp 


lie Lys Val Val Pro 


Arg Arg Lys 


Ala 


Lys 


He 


He Arg 


Asp 


25 


Tyr 


Gly 


Lys Gin Met Ala Gly 


Asp Asp Cys 


Val 


Ala 


Ser 


Arg Gin 


Asp 




Glu 


Asp 


(SEQ ID NO: 2) . 















The present invention especially relates to an adenoviral vector vaccine which 
comprises a codon optimized HTV-l DNA pol construct wherein, in addition to 
deletion of the portion of the wild type sequence encoding the protease activity, a 

30 combination of active site residue mutations are introduced which are deleterious to 
HIV-1 pol (RT-RH-IN) activity of the expressed protein. Therefore, the present 
invention preferably relates to an adenoviral HIV-1 DNA pol-based vaccine wherein 
the construct is devoid of DNA sequences encoding any PR activity, as well as 
containing a mutation(s) which at least partially, and preferably substantially, 

35 abolishes RT, RNase and/or IN activity. One type of HIV-1 pol mutant which is part 
and parcel of an adenoviral vector vaccine may include but is not limited to a mutated 

59 



wo 02/22080 



PCT/USOl/28861 



DNA molecule comprising at least one nucleotide substitution which results in a point 
mutation which effectively alters an active site within the RT, RNase and/or IN 
regions of the expressed protein, resulting in at least substantially decreased 
. enzymatic activity for the RT, RNase H and/or IN functions of HIV-l Pol In a 
preferred embodiment of this portion of the invention, a HIV-l DNA pol construct 
contains a mutation or mutations within the Pol coding region which effectively 
abolishes RT, RNase H and IN activity. An especially preferable HIV-l DNA pol 
construct in a DNA molecule which contains at least one point mutation which alters 
the active site of the RT, RNase H and IN domains of Pol, such that each activity is at 
least substantially abolished. Such a HIV-l Pol mutant will most likely comprise at 
least one point mutation in or around each catalytic domain responsible for RT, 
RNase H and IN activity, respectfully. To this end, an especially preferred HIV-l 
DNA pol constmct is exemplified herein and contains nine codon substitution 
mutations which results in an inactivated Pol protein (lA Pol: SEQ ID NO:4, Figure 
17A-C) which has no PR, RT, RNase or IN activity, wherein three such point 
mutations reside within each of flie RT, RNase and IN catalytic domains. Therefore, 
an especially preferred exemplification is an adenoviral vaccine which comprises, in 
an appropriate fashion, a DNA molecule which encodes lA-pol, which contains aU 
nine mutations as shown below in Table 1. An additional preferred amino acid 
residue for substitution is AspSSl, localized within the RNase domain of Pol. Any 
combination of the mutations disclosed herein may suitable and therefore may be 
utilized as an lA-Pol-based vaccine of the present invention. While addition and 
deletion mutations are contemplated and within the scope of the invention, the 
preferred mutation is a point mutation resulting in a substitution of the wild type 
amino acid with an alternative amino acid residue. 



Table 1 



wtaa 


aa residue 


mutant aa 


enzvme function 


Asp 


112 


Ala 


RT 


Asp 


187 


Ala 


RT 


Asp 


188 


Ala 


RT 


Asp • 


445 


Ala 


RNase H 


Glu 


480 


Ala 


RNase H 


Asp 


500 


Ala 


RNase H 


Asp 


626 


Ala 


IN 


Asp 


678 


Ala 


IN 


Glu 


714 


Ala 


IN 
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It is preferred that point mutations be incorporated into the lApol mutant adenoviral 
vaccines of the present invention so as to lessen the possibility of altering epitopes in 
and around the active site(s) of HIV- 1 Pol. 

To this end, SEQ ID N0:3 discloses the nucleotide sequence which codes for 
5 a codon optimized pol in addition to the nine mutations shown in Table 1, disclosed as 
follows, and referred to herein as "lApol": 





AGATCTACCA TGGCCCCCAT 


CTCCCCCATT 


GAGACTGTGC 


CTGTGAAGCT 


GAAGCCTGGC 




AT66ATG6CC CCAA66TGAA 


GCAGTGGCCC 


CTGACTGAGG 


AGAAGATCAA 


GGCCCTGGTG 


• 


GAAATCTGCA CTGAGATGGA 


GAAGGAGGGC 


AAAATCTCCA 


AGATTGGCCC 


CGAGAACCCC 


10 

1 


TACAACACCC CTGTGTTTGC 


CATCAAGAAG 


AAGGACTCCA 


CCAAGTGGAG 


GAAGCTGGT6 




GACTTCAGGG AGCTGAACAA 


6AGGACCCAG 


GACTTCTGGG 


AGGTGCAGCT 


GGGCATCCCC 




CACCCCGCTG GCCTGAAGAA 


GAAGAAGTCT 


GTGACTGTGC 


TGGCTGTGGG 


GGATGCCTAC 




TTCTCTGTGC CCCTGGATGA 


GGACTTCAGG AAGTACACTG 


CCTTGACCAT 


CCCCTCCATC 




AACAATGAGA CCCCTGGCAT 


CAGGTACCAG 


TACAATGTGC 


TGCCCCAGGG 


CTGGAAGGGC 


15 


TCCCCTGCCA TCTTCCAGTC 


CTCCATGACC 


AAGATCCTGG 


AGCCCTTCAG 


GAAGCAGAAC 




CCTGACATTG TGATCTACCA 


GTACATGGCT 


GCCCTGTATG 


TGGGCTCTGA 


CCTGGAGATT 




GGGCAGCACA GGACCAAGAT 


TGAGGAGCTG AGGCAGCACC 


TGCTGAGGTG 


GGGCCTGACC 




ACCCCTGACA AGAAGCACCA 


GAAGGAGCCC 


CCCTTCCTGT 


GGATGGGCTA 


TGAGCTGCAC 




CCCGACAAGT GGACTGTGCA 


GCCCATTGTG 


CTGCCTGAGA 


AGGACTCCTG 


GACTGTGAAT 


20 


GACATCCA6A AGCTGGTGGG 


CAAGCTGAAC 


TGGGCCTCCC 


AAATCTACCC 


T6GCATCAAG 




GTGAGGCAGC TGTGCAAGCT 


GCTGAGGGGC 


ACCAAGGCCC 


TGACTGAGGT 


GATCCCCCTG 




ACTGAGGAGG CTGAGCTGGA 


GCTGGCTGAG AACAGGGAGA 


TCCTGAAGGA 


GCCTGTGCAT 




GGGGTGTACT ATGACCCCTC 


CAAGGACCTG ATTGCTGAGA 


TCCAGAAGCA 


GGGCCAGGGC 




CAGTGGACCT ACCAAATCTA 


CCA6GAGCCC 


TTCAAGAACC 


TGAAGACTGG 


CAAGTATGCC 


25 


AGGATGAGGG GGGCCCACAC 


CAATGATGTG 


AAGCAGCTGA 


CTGAGGCTGT 


GCAGAAGATC 




ACCACTGAGT CCATTGTGAT 


CTG6GGCAAG 


ACCCCCAAGT 


TCAAGCTGCC 


CATCCAGAAG 




GAGACCTGGG AGACCTGGTG 


GACTGAGTAC 


TGGCAGGCCA 


CCTGGATCCC 


TGAGTGGGAG 




TTTGTGAACA CCCCCCCCCT 


GGTGAAGCTG 


TGGTACCAGC 


TGGAGAAGGA 


GCCCATTGTG 




GGGGCTGAGA CCTTCTATGT 


GGCTGGGGCT 


GCCAACAGGG 


AGACCAAGCT 


GGGCAAGGCT 


30 


GGCTATGTGA CCAACAGGGG 


CAGGCAGAAG 


GTGGTGACCC 


TGACTGACAC 


CACCAACCAG 




AAGACTGCCC TCCAGGCCAT- 


CTACCTGGCC 


CTCCAGGACT 


CTGGCCTGGA 


GGTGAAC ATT 




GT6ACTGCCT CCCAGTATGC 


CCTGGGCATC 


ATCCAGGCCC 


AGCCTGATCA 


GTCTGAGTCT 




GAGCTGGTGA ACCAGATCAT 


TGAGGAGCTG ATCAAGAAGG 


AGAAGGTGTA 


CCTGGCCTGG 




GTGCCTGCCC ACAAGGGCAT 


TGGGGGCAAT 


GAGCAGGTGG 


ACAAGCTGGT 


GTCTGCTGGC 


35 


ATCAGGAAGG TGCTGTTCCT 


GGATGGCATT 


GACAAGGCCC 


AGGATGA6CA 


TGAGAA6TAC 




CACTCCAACT GGAGGGCTAT 


GGCCTCTGAC 


TTCAACCTGC 


CCCCTGTGGT 


GGCTAAGGAG 
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ATTGTG6CCT CCTGTGACAA GTGCCAGCT6 AAGG6GGAGG CCATGCATGG GCAGGTGGAC 
TGCTCCCCTG GCATCTGGCA GCTGGCCTGC ACCCACCTGG AGGGCAAGGT GATCCTGGTG 
GCTGTGCATG TGGCCTCCGG CTACATTGAG GCTGAGGTGA TCCCTGCTGA GACAGGCCAG 
GAGACTGCCT ACTTCCTGCT GAAGCTGGCT GGCAGGTGGC CTGT6AAGAC CATCCACACT 
5 GCCAATGGCT ^CCAACTTCAC TGGGGCCACA GTGAGGGCTG CCTGCTGGTG GGCTGGCATC 
AAGCAGGAGT TTGGCATCCC CTACAACCCC CAGTCCCAGG GGGTGGTGGC CTCCATGAAC 
AAGGAGCTGA AGAAGATCAT TGGGCAGGTG AGGGACCAGG CTGAGCACCT GAAGACAGCT 
GTGCA6ATGG CTGTGTTCAT CCACAACTTC AAGAGGAAGG GGGGCATCGG GGGCTACTCC 
- GCT6GGGAGA GGATTGTGGA CATCATTGCC ACAGACATCC AGACCAAGGA GCTCCAGAAG 
10 CAGATCACCA AGATCCAGAA CTTCAGGGTG TACTACAGGG ACTCCAGGAA CCCCCTGTGG 
AAGGGCCCTG CCAAGCTGCT GTGGAAGGGG GAGGGGGCTG TGGTGATCCA GGACAACTCT 
GACATCAAGG TGGTGCCCAG GAGGAAGGCC AAGATCATCA GGGACTATGG CAAGCAGATG 
GCTGGGGATG ACTGTGTGGC CTCCAGGCAG GATGAGGACT AAAGCCCGGG CAGATCT (SEQ ID 
N0:3)- 

15 In order to produce the lA-pol-based adenoviral vaccines of the present 

invention, inactivation of the enzymatic functions was achieved by replacing a total of 
nine active site residues from the enzyme subunits with alanine side-chains. As 
shown in Table 1, all residues that comprise the catalytic triad of the polymerase, 
namely Aspll2, Aspl87, and AsplSS, were substituted with alanine (Ala) residues 

20 (Larder, et al., Nature 1987, 327: 716-717; Larder, et al., 1989. Proc. NatL Acad. ScL 
1989, 86: 4803-4807). Three additional mutations were introduced at Asp445, 
GIu480 and Asp500 to abolish RNase H activity (AspSSl was left unchanged in this 
lA Pol construct), with each residue being substituted for an Ala residue, respectively 
(Davies, et al., 1991, Science 252:, 88-95; Schatz, et al., 1989, FEES Lett. 257: 311- 

25 314; Mizrahi, et al., 1990, Nucl. Adds. Res. 18: pp. 5359-5353). HIV pol integrase 
function was abolished through three mutations at Asp626, Asp678 and Glu714. 
Again, each of these residues has been substituted with an Ala residue (Wiskerchen, 
et al., 1995, J. Virol. 69: 376-386; Leavitt, et al., 1993, J. Biol. Chem. 268: 2113- 
2119). Amino acid residue Pro3 of SEQ ID NO:4 marks the start of the RT gene. 

30 The complete amino add sequence of lA-Pol is disclosed herein as SEQ ID NO:4 and 
Figure 17A-C, as follows: 

Met Ala Pro lie Ser Pro He Glu Thr Va.1 Pro Val Lys Leu Lys Pro 
Gly Met Asp Gly Pro Lys Val Lys Gin Trp Pro Leu Thr Glu Glu Lys 
lie Lys Ala Leu Val Glu He Cys Thr Glu Met Glu Lys Glu Gly Lys 
35 He Ser Lys He Gly Pro Glu Asn Pro Tyr Asn Thr Pro Val Phe Ala 
He Lys Lys Lys Asp Ser Thr Lys Trp Arg Lys Leu Val Asp Phe Arg 
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.Glu Leu Asn hys Arg Thr Gin 
Pro His Pro Ala Gly Leu Lys 
Val Gly Asp Ala Tyr Phe Ser 
Tyr Thr Ala Phe Thr He Pro 
5 Arg Tyr Gin Tyr Asn Val Leu 
He Phe Gin Ser Ser Met Thr 
Asn Pro Asp He Val He Tyr 
Ser Asp Leu Glu He Gly Gin 
Gin His Leu Leu Arg Trp Gly 

10 Lys Glu Pro Pro Phe Leu Trp 
Trp Thr Val Gin Pro He Val 
Asn Asp He Gin Lys Leu Val 
Tyr Pro Gly He Lys Val Arg 
Lys Ala Leu Thr Glu Val He 

15 Leu Ala Glu Asn Arg Glu He 
Tyr Asp Pro Ser Lys Asp Leu 
Gly Gin Trp Thr Tyr Gin He 
Thr Gly Lys Tyr Ala Arg Met 
Gin Leu Thr Glu Ala Val Gin 

20 Trp Gly Lys Thr Pro Lys Phe 
Glu Thr Trp Trp Thr Glu Tyr 
Glu Phe Val Asn Thr Pro Pro 
Lys Glu Pro He Val Gly Ala 
Asn Arg Glu Thr Lys Leu Gly 

25 Arg Gin Lys Val Val Thr Leu 
Leu Gin Ala He Tyr Leu Ala 
He Val Thr Ala Ser Gin Tyr 
Asp Gin Ser Glu Ser Glu Leu 
Lys Lys Glu Lys Val Tyr Leu 

30 Gly Gly Asn Glu Gin Val Asp 
- Val. Leu Phe Leu Asp Gly He 
Tyr His Ser Asn Trp Arg Ala 
Val Val Ala Lys Glu He Val 
Gly Glu Ala Met His Gly Gin 

35 Leu Ala Cys Thr His Leu Glu 
Val Ala Ser Gly Tyr He Glu 



Asp Phe Trp Glu Val Gin Leu Gly He 
Lys Lys Lys Ser Val Thr Val Leu Ala 
Val Pro Leu Asp Glu Asp Phe Arg Lys 
Ser He Asn Asn Glu Thr Pro Gly He 
Pro Gin Gly Trp Lys Gly Ser Pro Ala 
Lys He Leu Glu Pro Phe Arg Lys Gin 
Gin Tyr Met Ala Ala Leu Tyr Val Gly 
His Arg Thr Lys He Glu Glu Leu Arg 
Leu Thr Thr Pro Asp Lys Lys His Gin 
Met Gly Tyr Glu Leu His Pro Asp Lys 
Leu Pro Glu Lys Asp Ser Trp Thr Val 
Gly Lys Leu Asn Trp Ala Ser Gin He 
Gin Leu Cys Lys Leu Leu Arg Gly Thr 
Pro Leu Thr Glu Glu Ala Glu Leu Glu 
Leu Lys Glu Pro Val His Gly Val Tyr 
He Ala Glu He Gin Lys Gin Gly Gin 
Tyr Gin Glu Pro Phe Lys Asn Leu Lys 
Arg Gly Ala His Thr Asn Asp Val Lys 
Lys He Thr Thr Glu Ser He Val He 
Lys Leu Pro He Gin Lys Glu Thr Trp 
Trp Gin Ala Thr Trp He Pro Glu Trp 
Leu Val Lys Leu Trp Tyr Gin Leu Glu 
Glu Thr Phe Tyr Val Ala Gly Ala Ala 
Lys Ala Gly Tyr Val Thr Asn Arg Gly 
Thr Asp Thr Thr Asn Gin Lys Thr Ala 
Leu Gin Asp Ser Gly Leu Glu Val Asn 
Ala Leu Gly He He Gin Ala Gin Pro 
Val Asn Gin He He Glu Gin Leu He 
Ala Trp Val Pro Ala His Lys Gly lie 
Lys Leu Val Ser Ala Gly He Arg Lys 
Asp Lys Ala Gin Asp Glu His Glu Lys 
Met Ala Ser Asp Phe Asn Leu Pro Pro 
Ala Ser Cys Asp Lys Cys Gin Leu Lys 
Val Asp Cys Ser Pro Gly He Trp Gin 
Gly Lys Val He Leu Val Ala Val His 
Ala Glu Val He Pro Ala Glu Thr Gly 
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Gin Glu Thr Ala Tyr Phe Leu Leu Lys Leu Ala Gly Arg Trp Pro Val 
Lys Thr lie His Thr Ala Asn Gly Ser Asn Phe Thr Gly Ala Thr Val 
Arg Ala Ala Cys Trp Trp Ala Gly lie Lys Gin Glu Phe Gly lie Pro 
Tyr Asn Pro Gin Ser Gin Gly Val Val Ala Ser Met Asn Lys Glu Leu 
S Lys Lys lie lie Gly Gin Val Arg Asp Gin Ala Glu His Leu Lys Thr 
Ala Val Gin Met Ala Val Phe He His Asn Phe Lys Arg Lys Gly Gly 
He Gly Gly Tyr Ser Ala Gly Glu Arg He Val Asp He He Ala Thr 
Asp He Gin Thr Lys Glu Leu Gin Lys Gin He Thr. Lys He Gin Asn 
Phe Arg Val Tyr Tyr Arg Asp Ser Arg Asn Pro Leu Trp Lys Gly Pro 
10 Ala Lys Leu Leu Trp Lys Gly Glu Gly Ala Val Val He Gin Asp Asn 
Ser Asp He Lys Val Val Pro Arg Arg Lys Ala Lys He He Arg Asp 
Tyr Gly Lys Gin Met Ala Gly Asp Asp Cys Val Ala Ser Arg Gin Asp 
Glu Asp (SEQ ID NO: 4) . 

As noted above, it will be understood that any combination of the mutations 
15 disclosed above may be suitable. and therefore be utilized as an lA-pol-based 

adenoviral HIV vaccine of the present invention, either when administered alone or in 
a combined modality regime and/or a prime-boost regimen. For example, it may be - 
possible to mutate only 2 of the 3 residues within the respective reverse transcriptase, 
RNase-H, and integrase coding regions while still abolishing these enzymatic 
20 activities. However, the lA-pol construct described above and disclosed as SEQ ID 

m 

N0:3, as well as the expressed protein (SEQ ID NO:4;) is preferred. It is also 
preferred that at least one mutation be present in each of the three catalytic domains. 

Another aspect of this portion of the invention are codon optinrized HIV-1 
Pol-based vaccine constructions which comprise a eukaryotic trafficking signal 

25 peptide such as from tPA (tissue-type plasminogen activator) or by a leader peptide 
such as is found in highly expressed mammalian proteins such as immunoglobulin 
leader peptides. Any functional leader peptide may be tested for efficacy. However, 
a preferred embodiment of the present invention, as with HIV-1 Nef constructs shown 
herein, is to provide for a HIV-1 Pol mutant adenoviral vaccine construction wherein 

30 the pol coding region or a portion thereof is operati vely linked to a leader peptide, 

r 

preferably a leader peptide from human tPA. In other words, a codon optimized 
HIV-1 Pol mutant such as lA-Pol (SEQ ID N0:4) may also comprise a leader peptide 
at the amino terminal portion of the protein, which may effect cellular trafficking and 
hence, immunogenicity of the expressed protein within the host cell. As noted in 
35 Figure 16A-B, a DNA vector which may be utilized to practice the present mvention 
may be modified by known recombinant DNA methodology to contain a leader signal 
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peptide of interest, such that downstream cloning of the modified HTV-l protein of 
interest results in a nucleotide sequence which encodes a modified HIV-1 tPA/Pol 
protein. In the alternative, as noted above, insertion of a nucleotide sequence which 
mcodes a leader peptide may be inserted into a DNA vector housing the open reading 
5 frame for the Pol protein of interest. Regardless of the cloning strategy, the end result 
is a polynucleotide vaccine which comprises vector components for effective gene 
expression in conjunction with nucleotide sequences which encode a modified HIV-1 
Pol protein of interest, including but not limited to a HIV-1 Pol protein which 
contains a leader peptide. The amino acid sequence of the human tPA leader utilized 
10 herein is as foUows: MDAMKRGLCCVLLLCGAVFVSPSEISS (SEQ ID NO: 17). 

r 

Therefore, another aspect of the present invention is to generate HIV-1 Pol-based 
vaccine constructions which comprise a eukaryotic trafficking signal peptide such as 
from tPA. To this end, the present invention relates to a DNA molecule which 
encodes a codon optimized wt-pol DNA construct wherein die protease (PR) activity 

15 is deleted and a human tPA leader sequence is fused to the 5' end of the coding region. 
A DNA molecule which encodes this protein is disclosed herein as SEQ ID NO:5, the 
open reading frame disclosed herein as SEQ ID N0:6. 

To this end, the present invention relates to a DNA molecule which encodes a 
codon optimized wt-pol DNA construct wherein the protease (PR) activity is deleted 

20 and a human tPA leader sequence is fused to the 5* end of the coding region ( herein, 
"tPA-wt-pol"). A DNA molecule which encodes this protein is disclosed herein as 
SEQ ID NO:5, the open reading frame being contained from an initiating Met residue 
at nucleotides 8-10 to a termination codon from nucleotides 2633-2635. SEQ ID 
N0:5 is as follows: . 

25 GATCACCATG GATGCAATGA AGAGAGGGCT CTGCTGTGTG CTGCTGCTGT GTGGAGCAGT 
CTTCGTTTCG CCCAGCGAGA TCTCCGCCCC CATCTCCCCC ATTGAGACTG TGCCTGTGAA 
GCTGAAGCCT GGCATGGATG GCCCCAAGGT GAAGCAGTGG CCCCTGACTG AGGAGAAGAT 
CAAGGCCCTG GTGGAAATCT GCACTGAGAT GGA6AAGGA6 GGCAAAATCT CCAAGATTGG 
-CCCCGAGAAC CCCTACAACA CCCCTGTGTT TGCCATCAAG AAGAAGGACT CCACCAAGTG 

30 6AGGAAGCTG GTGGACTTCA GGGAGCTGAA CAAGAGGACC CAGGACTTCT GGGAGGTGCA 
GCTGGGCATC CCCCACCCCG CTGGCCTGAA GAAGAAGAAG TCTGTGACTG TGCTGGATGT 
GGGGGATGCC TACTTCTCTG TGCCCCTGGA TGAGGACTTC AGGAAGTACA CTGCCTTCAC 
CATCCCCTCC ATCAACAATG AGACCCCTGG CATCAGGTAC CAGTACAATG TGCTGCCCCA 
GGGCTGGAAG GGCTCCCCTG CCATCTTCCA GTCCTCCATG ACCAAGATCC TGGAGCCCTT 

.35 CAGGAAGCAG AACCCTGACA TTGTGATCTA CCAGTACATG GATGACCTGT ATGTGGGCTC 
TGACCTGGAG ATTGGGCAGC ACAGGACCAA GATTGAGGAG CTGAGGCAGC ACCTGCTGA6 
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GTGGGGCCTG ACCACCCCTG ACAAGAAGCA CCAGAAGGAG CCCCCCTTCC TGTGGATGGG 
CTATGAGCTG CACCCCGACA AGTGGACTGT GCAGCCCATT GTGCTGCCTG AGAAG6ACTC . 
CTGGACTGTG AATGACATCC AGAAGCTGGT GGGCAAGCTG AACTGGGCCT CCCAAATCTA 
CCCTGGCATC AAGGTGAGGC AGCTGTGCAA GCTGCTGAGG GGCACCAAGG CCCTGACTGA 
5 GGTGATCCCC CTGACTGAGG AGGCTGAiGCT GGAGCTGGCT GAGAACAGGG AGATCCTGAA 
GGAGCCTGTG CATGGGGTGT ACTATGACCC CTCCAAGGAC CTGATTGCTG AGATCCAGAA 
GCAGGGCCAG GGCCAGTGGA CCTACCAAAT CTACCAGGAG CCCTTCAAGA ACCTGAAGAC 
TGGCAAGTAT GCCAGGATGA GGGGGGCCCA CACCAATGAT GTGAAGCAGC TGACTGAGGC 
TGTGCAGAAG ATCACCACTG AGTCCATTGT GATCTGGGGC AAGACCCCCA AGTTCAAGCT 

10 GCCCATCCAG AAGGAGACCT GGGAGACCTG GTGGACTGAG TACTGGCAGG CCACCTGGAT 
CCCTGAGTGG GAGTTTGTGA ACACCCCCCC CCTGGTGAAG CTGTGGTACC AGCTGGAGAA 
GGAGCCCATT GTGGGGGCTG AGACCTTCTA TGTGGATGGG GCTGCCAACA GGGAGACCAA 
GCTGGGCAAG GCTGGCTATG TGACCAACAG GGGCAGGCAG AAGGTGGTGA CCCTGACTGA 
CACCACCAAC CAGAAGACTG AGCTCCAGGC CATCTACCTG GCCCTCCAGG ACTCTGGCCT 

15 GGAGGTGAAC ATTGTGACTG ACTCCCAGTA TGCCCTGGGC ATCATCCAGG CCCAGCCTGA 
TCA6TCTGAG TCTGAGCT6G T6AACCAGAT CATTGAGCAG CTGATCAAGA AGGAGAAGGT 
GTACCTGGCC TGGGTGCCTG CCCACAAGGG CATTGGGGGC AATGAGCAGG TGGACAA6CT 
GGTGTCTGCT GGCATCAGGA AGGTGCTGTT CCTGGATGGC ATTGACAAGG GCCAGGATGA 
GCATGAGAAG TACCACTCCA ACTGGAGGGC TATGGCCTCT GACTTCAACC TGCCCCCTGT 

20 G6TGGCTAAG GAGATTGTGG CCTCCTGTGA CAAGTGCCAG CTGAAGGGGG AGGCCATGCA 
TGGGCAGGTG GACTGCTCCC CTGGCATCTG GCAGCTGGAC TGCACCCACC TGGAGGGCAA 
GGTGATCCTG GTGGCTGTGC ATGTGGCCTC CGGCTACATT GAGGCTGAGG TGATCCCTGC 
TGAGACAGGC CAGGAGACTG CCTACTTCCT GCTGAAGCTG GCTGGCAGGT GGCCTGTGAA 
GACCATCCAC ACTGACAATG GCTCCAACTT CACTGGGGCC ACAGTGAGGG CTGCCTGCTG 

25 GTGGGCTGGC ATCAAGCAGG AGTTTGQCAT CCCCTACAAC CCCCAGTCCC AGGGGGTGGT 
GGAGTCCATG AACAAGGAGC TGAAGAAGAT CATTGGGCAG GTGAGGGACC AGGCTGAGCA 
CCTGAAGACA GCTGTGCAGA TGGCTGTGTT CATCCACAAC TTCAAGAGGA AGGGGGGCAT 
CGGGGGCTAC TCCGCTGGGG AGAGGATTGT GGACATCATT 6CCACAGACA TCCAGACCAA 
GGAGCTCCAG AAGCAGATCA CCAAGATCCA GAACTTCAGG GTGTACTACA GGGACTCCAG 

30 GAACCCCCTG TGGAAGGGCC CTGCCAAGCT GCTGTGGAAG GGGGAGGGGG CTGTGGTGAT 
CCAGGACAAC TCTGACATCA AGGTGGTGCC CAGGAGGAAG GCCAAGATCA TCAGGGACTA 
TGGCAAGCAG ATGGCTGGGG ATGACTGTGT GGCCTCCAGG CAGGATGAGG ACTAAAGCCC 
GGGCAGATCT (SEQ ID NO: 5). 

The open reading frame of the wild type tPA-pol construct disclosed as SEQ 
35 ID N0:5 contains 875 amino acids, disclosed herein as SEQ ID NO:6, as follows: 

Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val I*eu Leu Leu Cys Gly 
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Ala Val 


Phe 


Val 


Ser 


Pro 


Ser 




Glu Thr 


Val 


Pro 


Val 


Lys 


Leu 




Lys Gin 


Trp 


Pro 


Leu 


Thr 


Glu 




Cys Thr 


Glu 


Met 


Glu 


Lys 


Glu 


5 


Asn Pro 


Tyr 


Asn 


Thr 


Pro 


Val 




Lys Trp 


Arg 


Lys 


Leu 


Val 


Asp 




Asp Phe 


Trp 


Glu Val 


Gin 


Leu 




Lys Lys 


Lys 


Ser 


Val 


Thr 


Val 




Val Pro 


Leu 


Asp 


Glu 


Asp 


Phe 


10 


Ser He 


Asn 


Asn 


Glu 


Thr 


Pro 




Pro Gin 


Gly 


Trp 


Lys 


Gly Ser 




Lys He 


Leu 


Glu 


Pro 


Phe 


Arg 




Gin Tyr 


Met 


Asp 


Asp 


Leu Tyr 




His Arg 


Thr 


Lys 


XJ.e 


Glu 


Glu 


15 


Leu Thr 


Thr 


Pro 


Asp 


Lys 


Lys 




Met Gly 


Tyr 


Glu 


Leu 


His 


Pro 




Leu Pro 


Glu 


Lys 


Asp 


Ser Trp 




Gly Lys 


Leu 


Asn 


Trp 


Ala 


Ser 




Gin Leu 


Cys 


Lys 


Leu 


Leu Arg 


20 


Pro Leu 


Thr 


Glu 


Glu 


Ala 


Glu 




Leu Lys 


Glu 


Pro 


Val 


His 


Gly 




lie Ala 


Glu 


He 


Gin 


Lys 


Gin 




Tyr Gin 


Glu 


Pro 


Phe 


Lys Asn 




Arg Gly 


Ala 


His , 


Thr 


Asn Asp 


25 


Lys He 


Thr 


Thr 


Glu 


Ser 


He 




Lys Leu 


Pro 


lie 


Gin 


Lys 


Glu 




Trp Gin 


Ala 


Thr 


Trp 


He 


Pro 




Leu Val 


Lys 


Leu 


Trp 


Tyr 


Gin 




Glu Thr 


Phe 


Tyr 


Val 


Asp 


Gly 


30 


Lys Ala 


Gly 


Tyr Val 


Thr 


Asn 




Thr Asp 


Thr 


Thr 


Asn 


Gin 


Lys 




Leu Gin 


Asp 


Ser Gly 


Leu 


Glu 




Ala Leu 


Gly 


He 


He 


Gin 


Ala 




Val Asn 


Gin 


He 


He 


Glu 


Gin 


35 


Ala Trp 


Val 


Pro 


Ala 


His 


Lys 




Lys Leu 


Val 


Ser 


Ala 


Gly 


He 



Glu He Ser Ala Pro He Ser Pro He 
Lys Pro Gly Met Asp Gly Pro Lys Val 
Glu Lys lie Lys Ala Leu Val Glu He 
Gly Lys He Ser Lys He Gly Pro Glu 
Phe Ala He Lys Lys Lys Asp Ser Thr 
Phe Arg Glu Leu Asn Lys Arg Thr Gin 
Gly He Pro His Pro Ala Gly Leu Lys 
Leu Asp Val Gly Asp Ala Tyr Phe Ser 
Arg Lys Tyr Tlir Ala Phe Thr He Pro 
Gly He Arg Tyr Gin Tyr Asn Val Leu 
Pro Ala He Phe Gin Ser Ser Met Thr 
Lys Gin Asn Pro Asp He Val He lyr 
Val Gly Ser Asp Leu Glu He Gly Gin 
Leu Arg Gin His Leu Leu Arg Trp Gly 
His Gin Lys Glu Pro Pro Phe Leu Trp 
Asp Lys Trp Thr Val Gin Pro He Val 
Thr Val Asn Asp He Gin Lys Leu Val 
Gin He Tyr Pro Gly He Lys Val Arg 
Gly Thr Lys Ala Leu Thr Glu Val He 
Leu Glu Leu Ala Glu Asn Arg Glu He 
Val Tyr Tyr Asp Pro Ser Lys Asp Leu 
Gly Gin Gly Gin Trp Thr Tyr Gin He 
Leu Lys Thr Gly Lys Tyr Ala Arg Met 
Val Lys Gin Leu Thr Glu Ala Val Gin 
Val lie Trp Gly Lys Thr Pro Lys Phe 
Thr Trp Glu Thr Tirp Trp Thr Glu Tyr 
Glu Trp Glu Phe Val Asn Thr Pro Pro 
Leu Glu Lys Glu Pro He Val Gly Ala 
Ala Ala Asn Arg Glu Thr Lys Leu Gly 
Arg Gly Arg Gin Lys Val Val Thr Leu 
Thr, Glu. Leu Gin Ala He Tyr Leu Ala 
Val Asn He Val Thr Asp Ser Gin Tyr 
Gin Pro Asp Gin Ser Glu Ser Glu Leu 
Leu He Lys Lys Glu Lys Val Tyr Leu 
Gly He Gly Gly Asn Glu Gin Val Asp 
Arg Lys Val Leu Phe Leu Asp Gly He 
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Asp Lys Ala Gin Asp 61u His Glu Lys Tyr His Ser Asn Trp Arg Ala 
Met Ala Ser Asp Phe Asn Leu Pro Pro Val Val Ala Lys Glu lie Val 
Ala Ser Cys Asp Lys Cys Gin Leu Lys Gly Glu Ala Met His Gly Gin 
Val Asp Cys Ser Pro Gly lie Trp Gin Ijcu Asp Cys Thr His Leu Glu 
5 Gly Lys Val lie Leu Val Ala Val His Val Ala Ser Gly Tyr lie Glu 
Ala Glu Val lie Pro Ala Glu Thr Gly Gin Glu Thr Ala Tyr Phe Leu 
Leu Lys Leu Ala Gly Arg Trp Pro Val Lys a*hr lie His Thr Asp Asn 
Gly Ser Asn Phe Thr Gly Ala Thr Val Arg Ala Ala Cys Trp Trp Ala 
Gly lie Lys Gin Glu Phe Gly lie Pro Tyr Asn Pro Gin Ser Gin Gly 

10 Val Val Glu Ser Met Asn Lys Glu Leu Lys Lys lie lie Gly Gin Val 
Arg Asp Gin Ala Glu His Leu Lys Thr Ala Val Gin Met Ala Val Phe 
lie His Asn Phe Lys Arg Lys Gly Gly lie Gly Gly Tyr Ser Ala Gly 
Glu Arg lie Val' Asp lie lie Ala Thr Asp lie Gin Thr Lys Glu Leu 
Gin Lys Gin lie Thr Lys lie Gin Asn Phe Arg Val Tyr Tyr Arg Asp 

15 Ser Arg Asn Pro Leu Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 
Glu Gly Ala Val Val lie Glh Asp Asn Ser Asp lie Lys Val Val Pro 
Arg Arg Lys Ala Lys lie lie Arg Asp Tyr Gly Lys Gin Met Ala Gly 
Asp Asp Cys Val Ala Ser Arg Gin Asp Glu Asp (SEQ ID NO: 6) . 

The present invention also relates to a codon optimized EHV-l Pol mutant 

20 contained within a recombinant adenoviral vector such as lA-Pol (SEQ ID N0:4) 
which comprises a leader peptide at the amino terminal portion of the protein, which 
may effect ceUular trafficking and hence, immunogenicity of the expressed protein 
within the host celL Any such adenoviial-based HIV-1 DNA pol mutant disclosed in 
the above paragraphs is suitable for fusion downstream of a leader peptide, such as a 

25 leader peptide including but not limited to the human tPA leader sequence. Therefore, 
any such leader peptide-based HIV-1 pol mutant construct may include but is not 
limited to a mutated DNA molecule which effectively alters the catalytic activity of 
the RT, RNase and/or IN region of the expressed protein, resulting in at least 
substantially decreased enzymatic activity one or more of the RT, RNase H and/or IN 

30 functions of HIV-1 Pol. In a preferred embodiment of this portion of the invention, a 
leader peptide/EDTV-l DNA pol construct contains a mutation or mutations within the 
Pol coding region which effectively abolishes RT, RNase H and IN activity. An 
especially preferable HTV-1 DNA pol constmct is a DNA molecule which contains at 
least one point mutation which alters the active site and catalytic activity within the 

35 RT, RNase H and IN domains of Pol, such that each activity is at least substantially 
abolished, and preferably totally abolished. Such a HIV-1 Pol mutant will most likely 



68 



wo 02/22080 



PCT/USOl/28861 



comprise at least one point mutation in or around each catalytic domain responsible for 
RT, RNase H and IN activity, respectfully. An especially preferred embodiment of 
this portion of the invention relates to a human tPA leader fused to the lA-Pol protein 
comprising the nine mutations shown in Table 1. The DNA molecule is disclosed 
herein as SEQ ID N0:7 and the expressed tPA-IA Pol protein comprises a fusion 
junction as shown in Figure 18. The complete amino acid sequence of the expressed 
protein is set forth in SEQ ID NO:8. To this end, SEQ ID N0:7 discloses the 
nucleotide sequence which codes for a human tPA leader fused to the lA Pol protein 
comprising the nine mutations shown in Table 1 (herein, "tPA-opt-IApol"). The open 
reading frame begins with the initiating Met (nucleotides 8-10) and terminates with a 
"TAA" codon at nucleotides 2633-2635. The nucleotide sequence encoding tPA- 
lAPol is also disclosed as follows: 

GATCACCATG GATGCAATGA AGAGAGGGCT CTGCTGTGTG CTGCTGCTGT GTGGAGCAGT 
CTTCGTTTCG CCCAGCGAGA TCTCCGCCCC CATCTCCCCC ATTGAGACTG TGCCTGTGAA 
GCTGAAGCCT GGCATGGATG GCCCCAAGGT GAAGCAGTGG CCCCTGACTG AGGAGAAGAT 
CAAGGCCCTG GTGGAAATCT GCACTGAGAT GGAGAAGGAG GGCAAAATCT CCAAGATTGG 
CCCCGAGAAC CCCTACAACA CCCCTGTGTT TGCCATCAAG AAGAAGGACT CCACCAAGTG 
GAGGAAGCTG GTGGACTTCA GGGAGCTGAA CAAGAGGACC CAGGACTTCT GGGAGGTGCA 
GCTGGGCATC CCCCACCCXG CTGGCCTGAA GAA6AAGAAG TCTGTGACTG TGCTGGCTGT 
GGGGGATGCC TACTTCTCTG TGCCCCTGGA TGAGGACTTC AGGAAGTACA CTGCCTTCAC 
CATCCCCTCC ATCAACAATG AGACCCCTGG CATCAGGTAC CAGTACAATG TGCTGCCCCA 
GGGCTGGAAG GGCTCCCCTG CCATCTTCCA GTCCTCCATG ACCAAGATCC TGGAGCCCTT 
CAGGAAGCAG AACCCTGACA TTGTGATCTA CCAGTACATG GCTGCCCTGT ATGTGGGCTC 
TGACCTGGAG ATTGGGCAGC ACAGGACCAA GATTGAGGAG CTGAGGCAGC ACCTGCTGAG 
GTGGGGCCTG ACCACCCCTG ACAA6AAGCA GGAGAAGGAG CCCCCCTTCC TGTGGATGGG 
CTATGAGCTG CACCCCGACA AGTGGACTGT GCAGCCCATT GTGCTGCCTG AGAAGGACTC 
CTGGACTGTG AATGACATCC AGAAGCTGGT GGGCAAGCOXS AACTGGGCCT CCCAAATCTA 
CCCTGGCATC AAGGTGAGGC AGCTGTGCAA KTGCT6AGG GGCACCAAGG CCCTGACTGA 
GGTGATCCCC CT6ACTGAGG AGGCT6AGCT GGAGCTGGCT GAGAACA6GG AGATCCTGAA 
GGAGCCTGTG CATGGGGTGT ACTATGACCC CTCCAAGGAC CTGATTGCTG AGATCCAGAA 
-GCAQGGCCAG GGCCAGTGGA CCTACCAAAT CTACCAG6AG CCCTTCAAGA ACCTGAAGAC 
TGGCAAGTAT GCCAGGATGA GGGG66CCCA CACCAAT6AT GTGAAGCAGC TGACTGAGGC 
TGTGCAGAAG ATCACCACTG AGTCCATTGT GATCTGGGGC AAGACCCCCA AGTTCAAGCT 
GCCCATCCAG AAGGAGACCT GGGAGACCTG GTGGACTGAG TACTGGCAGG CCACCTGGAT 
CCCTGAGTGG GAGTTTGTGA ACACCCCCCC CCTGGTGAAG CTGTGGTACC AGCTGGAGAA 
GGAGCCCATT GTGGGGGCTG AGACCTTCTA TGTGGCTGGG GCTGCCAACA GGGAGACCAA 

• ■ 
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GCTGGGCAAG 


GCTGGCTATG 


T6ACCAACAG 


GGGCAGGCA6 


AAGGTGGTGA 


CCCTGACTGA 


CACCACCAAC 


CAGAA6ACTG 


CCCTCCAGGC 


CATCTACCTG 


GGCCTCCAGG 


ACTCTGGCCT 


GGAGGTGAAC 


ATTGTGACTG 


CCTCCCAGTA 


TGCCCTGGGC 


ATCATCCAGG 


CCCAGCCTGA 


TCAGTCTGAG 


TCTGAGCTGG 


TGAACCAGAT 


CATTGAGCAG 


CTGATCAAGA 


AGGAGAA6GT 


GTACCTGGCC 


TGGGTGCCTG" 


CCCACAAGG6 


CATTGGGGGC 


"AATGAGCAGG 


TG6ACAAGCT 


GGTGTCTGCT 


GGCATCAGGA 


AGGTGCTGTT 


CCTGGATGGC 


ATTGACAAGG 


CCCAGGATGA 


GCAT6AGAAG 


TACCACTCCA 


ACTGGAGGGC 


TATGGCCTCT 


GACTTCAACC 


TGCCCCCTGT 


GGTGGCTAAG 


GAGATTGTGG 


CCTCCTGTGA 


CAAGTGCCAG 


CTGAAGG6GG 


AGGCCATGCA 


TG6GCAGGTG 


GACTGCTCCC 


CTGGCATCTG 


GCAGCTGGCC 


TGCACCCACC 


TGGAGGGCAA 


GGTGATCCTG 


GTGGCTGTGC 


ATGTGGCCTC 


CG6CTACATT 


GAGGCTGAGG 


TGATCCCTGC 


TGAGACAGGC 


CAGGAGACTG 


CCTACTTCCT 


GCTGAAGCTG 


GCTGGCAGGT 


GGCCTGTGAA 


GACCATCCAC 


ACTGCCAATG 


GCTCCAACTT 


CACTGGGGCC 


ACAGTGAGGG. 


CTGCCTGCTG 


GTGGGCTGGC 


ATCAAGCAG6 


AGTTTGGCAT 


CCCCTACAAC 


CCCCAGTCCC 


AGGGGGTGGT 


GGCCTCCATG 


AACAAGGAGC 


TGAAGAAGAT 


CATTGGGCAG 


GTGAGGGACC 


AGGCTGAGCA 


CCTGAAGACA 


GCTGTGCAGA 


TGGCTGTGTT 


CATGCACAAC 


TTCAAGAGGA 


AGGGGGGCAT 


CGGGG6CTAC 


TCCGCTGGGG 


AGAGGATTGT 


GGACATCATT 


GCCACAGACA 


TCCAGACCAA 


GGAGCTCCA6 


AAGCAGATCA 


CCAA6ATCCA 


GAACTTCAG6 


GTGTACTACA 


GGGACTCCAG 


GAACCCCCTG 


TGGAAGGGCC 


CTGCCAAGCT 


GCTGTGGAAG 


GGGGAGGGGG 


CTGTGGTGAT 


CCAGGACAAC 


TCTGACATCA 


AGGTGGTGCC 


CAGGAGGAAG 


GCCAAGATCA 


TCAGGGACTA 


TGGCAAGCAG 


ATGGCTGGGG 


ATGACTGTGT 


GGCCTCCAGG 


CAGGATGAGG ACTAAAGCCC 


GGGCAGATCT 


(SEQ ID NO: 


7) . 









The open reading frame of the tPA-IA-pol construct disclosed as SEQ ID 



N0:7 contains 875 amino acids, disclosed herein as tPA-IA-Pol and SEQ ID NO:8, as 



follows: 



Met 


Asp 


Ala 


Met Lys 


Arg Gly 


Leu Cys 


Cys 


Val 


Leu 


Leu Leu Cys Gly 


Ala 


Val 


Phe 


Val Ser 


Pro 


Ser 


Glu He 


Ser 


Ala 


Pro 


He Ser Pro He 


Glu 


Thr 


Val 


Pro Val 


Lys 


Leu 


Lys Pro 


Gly 


Met 


Asp 


Gly Pro Lys Val 


Lys 


Gin 


Trp 


Pro Leu 


Thr 


Glu 


Glu Lys 


He 


Lys 


Ala 


Leu Val Glu He 


Cys 


Thr 


Glu 


Met Glu 


Lys 


Glu 


Gly Lys 


lie 


Ser 


Lys 


He Gly Pro Glu 


Asn 


Pro 


Tyr 


Asn Thr 


Pro 


Val 


Phe Ala 


He 


Lys 


Lys 


Lys Asp Ser Thr 


Lys 


Trp 


Arg 


Lys Leu 


Val 


Asp 


Phe Arg 


Glu 


Leu 


Asn 


Lys Arg Thr Gin 


Asp 


Phe 


Trp 


Glu Val 


Gin 


Leu 


Gly lie 


Pro 


His 


Pro 


Ala Gly Leu Lys 


Lys 


Lys 


Lys 


Ser Val 


Thr 


Val 


Leu Ala 


Val 


Gly 


Asp 


Ala Tyr Phe Ser 


Val 


Pro 


Leu 


Asp Glu 


Asp 


Phe 


Arg Lys 


Tyr 


Thr 


Ala 


Phe Thr He Pro 


Ser 


lie 


Asn 


Asn Glu 


Thr 


Pro 


Gly He 


Arg 


Tyr 


Gin 


Tyr Asn Val Leu 


Pro 


Gin 


Gly 


Tip Lys 


Gly 


Ser 


Pro Ala 


He 


Phe 


Gin 


Ser Ser Met Thr 
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Lys lie Leu 61u Pro Phe Arg 
Gin Tyr Met Ala Ala lieu Tyr 
His Arg Thr Lys lie Glu Glu 
Leu Thr Thr Pro Asp Lys Lys 
5 Met Gly Tyr Glu Leu His Pro 
Leu Pro Glu Lys Asp Ser Trp 
Gly Lys Leu Asn Trp Ala Ser 
Gin Leu Cys Lys Leu Leu Arg 
Pro Leu Thr Glu Glu Ala Glu 

10 Leu Lys Glu Pro Val His Gly 
lie Ala Glu lie Gin Lys Gin 
Tyr Gin Glu Pro Phe Lys Asn 
Arg Gly Ala His Thr Asn Asp 
Lys lie Thr Thr Glu Ser lie 

15 Lys Leu Pro lie Gin Lys Glu 
Trp Gin Ala Thr Trp lie Pro 
Leu Val Lys Leu Trp Tyr Gin 
Glu Thr Phe Tyr Val Ala Gly 
Lys Ala Gly Tyr Val Thr Asn 

20 Thr Asp Thr Thr Asn Gin Lys 
Leu Gin Asp Ser Gly Leu Glu 
Ala Leu Gly lie He Gin Ala 
Val Asn Gin He lie Glu Gin 
Ala Trp Val Pro Ala His Lys 

25 Lys Leu Val Ser Ala Gly He 
Asp Lys Ala Gin Asp Glu His 
Met Ala Ser Asp Phe Asn Leu 
Ala Ser Cys Asp Lys Cys Gin 
Val Asp Cys Ser Pro Gly He 

30 Gly Lys Val He Leu Val Ala 
Ala Glu Val He Pro Ala Glu 
Leu Lys Leu Ala Gly Arg Trp 
Gly Ser Asn Phe Thr Gly Ala 
Gly He Lys Gin Glu Phe Gly 

35 Val val Ala Ser Met Asn Lys 
Arg Asp Gin Ala Glu His Leu 



Lys Gin Asn Pro Asp He Val He Tyr 
Val Gly Ser Asp Leu Glu He Gly Gin 
Leu Arg Gin His Leu Leu Arg Trp Gly 
His Gin Lys Glu Pro Pro Phe Leu Trp 
Asp Lys Trp Thr. Val Gin Pro He Val 
Thr Val Asn Asp He Gin Lys Leu Val 
Gin He Tyr Pro Gly He Lys Val Arg 
Gly Thr Lys Ala Leu Thr Glu Val He 
Leu Glu Leu Ala Glu Asn Arg Glu He 
Val Tyr Tyr Asp Pro Ser Lys Asp Leu 
Gly Gin Gly Gin Trp Thr Tyr Gin He 
Leu Lys Thr Gly Lys Tyr Ala Arg Met 
Val Lys Gin Leu Thr Glu Ala Val Gin 
Val He Trp Gly Lys Thr Pro Lys Phe 
Thr Trp Glu Thr Trp Trp Thr Glu Tyr 
Glu Trp Glu Phe Val Asn Thr Pro Pro 
Leu Glu Lys Glu Pro He Val Gly Ala 
Ala Ala Asn Arg Glu Thr Lys Leu Gly 
Arg Gly Arg Gin Lys Val Val Thr Leu 
Thr Ala Leu Gin Ala He Tyr Leu Ala 
Val Asn He Val Thr Ala Ser Gin Tyr 
Gin Pro Asp Gin Ser Glu Ser Glu Leu 
Leu He Lys Lys Glu Lys Val Tyr Leu 
Gly He Gly Gly Asn Glu Gin Val Asp 
Arg Lys Val Leu Phe Leu Asp Gly He 
Glu Lys Tyr His Ser Asn Trp Arg Ala 
Pro Pro Val Val Ala Lys Glu He Val 
Leu Lys Gly Glu Ala Met His Gly Gin 
Trp Gin Leu Ala Cys Thr His Leu Glu 
Val His Val Ala Ser Gly Tyr He Glu 
Thr Gly Gin Glu Thr Ala Tyr Phe. Leu 
Pro Val Lys Thr He His Thr Ala Asn 
Thr Val Arg Ala Ala Cys Trp Trp Ala 
He Pro Tyr Asn Pro Gin Ser Gin Gly 
Glu Leu Lys Lys He He Gly Gin Val 
Lys Thr Ala Val Gin Met Ala Val Phe 
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lie His Asn Phe hys Arg Lys Gly Gly lie Gly Gly Tyr Ser Ala Gly 
GIu Arg lie Val Asp lie lie Ala Thr Asp lie Gin Thr Lys Glu Leu 
Gin Lys Gin lie Thr Lys He Gin Asn Phe Arg Val Tyr Tyr Arg Asp 
Ser Arg Asn Pro Leu Trp Lys Gly Pro Ala Lys Leu Leu Trp Lys Gly 
5 Glu Gly Ala Val Val He Gin Asp Asn Ser Asp He Lys Val Val Pro 
Arg Arg Lys Ala Lys He He Arg Asp Tyr Gly Lys Gin Met Ala Gly 
Asp Asp Cys Val Ala Ser Arg Gin Asp Glu Asp (SEQ ID NO: 8) . 

EXAMPDEIS 

10 CODON OPTIMIZED HIV-1 NEF AND CODON OPTIMIZED 

HIV-1 NEF MODIFICATIONS 
Codon optimized version of HIV-1 Nef and HIV-1 Nef modifications are 
essentially as described in U.S. Application Serial No. 09/738,782, filed 
December 15, 2000 and PCT International Application PCTAJSOO/34162, also filed 

15 December 15, 2000, both documents which are hereby incorporated by reference. As 
disclosed within the above-mentioned docmnents, particular embodiments of codon 
optimized Nef and Nef modifications relate to a DNA molecule encoding HTV-l Nef - 
from the HTV-l jfrl isolate wherein the codons are optimized for expression in a 
mammalian system such as a human. The DNA molecule which encodes this protein 

20 is disclosed herein as SEQ ID NO:9, while the expressed open reading firame is 
disclosed herein as SEQ ID NO: 10. Another embodiment of Nef-based coding 
regions for use in the adenoviral vectors of the present invention comprise a codon 
optimized DNA molecule encoding a protein containing the human plasminogen 
activator (tpa) leader peptide fused with the NH2-terminus of the HTV-l Nef 

25 polypeptide. The DNA molecule which encodes this protein is disclosed herein as 
SEQ ID NO: 11, while the expressed open reading frame is disclosed herein as SEQ 
ID NO: 12. Another modified Nef optimized coding region relates to a DNA molecule 
encoding optimized HIV-1 Nef wherein the open reading firame codes for 
modifications at the amino terminal mynstylation site (Gly-2 to Ala-2) and 

30 substitution of the Leu-174-Leu-175 dileucine motif to Ala-174-Ala-175, herein 

described as opt nef (G2A, LLAA). The DNA molecule which encodes this protein is 
disclosed herein as SEQ ID NO: 13, while the expressed open reading frame is 
disclosed herein as SEQ ID NO: 14. An additional embodiment relates to a DNA 
molecule encoding optimized HEV-l Nef wherein the amino terminal myristylation 

35 site and dileucine motif have been deleted, as well as comprising a tPA leader peptide. 
This DNA molecule, opt tpanef (LLAA), comprises an open reading frame which 

72 



wo 02/22080 



PCT/USOl/28861 



encodes a Nef protein containing a tPA leader sequence fused to amino acid residue 
6-216 of HIV-l Nef (jfrl), wherein Leu-174 and Leu-175 are substituted with Ala-174 
and Ala- 175, herein referred to as opt tpanef (LLAA) is disclosed herein as SEQ ID 
NO: IS, while the expressed open reading frame is disclosed herein as SEQ ID NO: 16. 
5 As disclosed in the above-identified documents (U.S. Application Serial No. 

09/738,782 and PCX International Application PCT/USOO/34162) and reiterated 
herein, the following nef-based nucleotide and amino acid sequences which comprise 
the respective open reading frame are as follows: 

1 . The nucleotide sequence of the codon optimized version of HIV-1 jrfl 
10 nef gene is disclosed herriri as SEQ ID NO:9, as shown herein: 

GATCTGCCAC CATGGGCGGC AAGTGGTCCA AGAGGTCCGT GCCCGGCTGG TCCACCGTGA 
GGGAGAGGAT GAGGAGG6CC GAGCCCGCCG CCGACAGGGT GAGGAGGACC GAGCCCGCCG 
CCGTGGGCGT GGGCGCCGTG TCCAGGGACC TGGAGAAGCA CGGCGCCATC ACCTCCTCCA 
ACACCGCCGC CACCAACGCC GACTGCGCCT GGCTGGAGGC CCAGGAGGAC GAGGAGGTGG 

15 GCTTCCCCGT GAGGCCCCAG GTGCCCCTGA GGCCCATGAC CTACAAGGGC GCCGTGGACC 
TGTCCCACTT CCTGAAGGAG AAGGGCGGCC TGGAGGGCCT GATCCACTCC CAGAAGAGGC 
AGGACATCCT GGACCTGTGG GTGTACCACA CXTCAGGGCTA CTTCCCCGAC TGGCAGAACT 
ACACCCCCGG CCCCGGCATC AGGTTCCCCC TGACCTTCGG CTGGTGCTTC AAGCTGGTGC 
CCGTGGAGCC CGAGAAGGTG GAGGAGGCCA AC6AGGGCGA GAACAACTGC CTGCTGCACC 

20 CCATGTCCCA GCACGGCATC GAGGACCCCG AGAAGGAGGT GCTGGAGTGG AGGTTCGACT 
CCAAGCTGGC CTTCCACCAC GTGGCCAGGG AGCTGCACCC CGAGTACTAC AAGGACTGCT 
AAAGCCCGGG C (SEQ ID N0:9) . 

Prefened codon usage is as follows: Met (ATG), Gly (GGC), Lys (AAG), 
Trp (TGG), Ser (TCC), Arg (AGG), Val (GTG), Pro (CCC), Thr (ACC), Glu (GAG); 

25 Leu (CTG),i3is (CAC), He (ATC), Asn (AAC), Cys (TGC), Ala (GCC), Gin (CAG), 
Phe (TTC) and Tyr (TAG). For an additional discussion relating to mammalian 
(human) codon optimization, see WO 97/31115 (PCTAJS97/02294), which is hereby 
incorporated by reference. See also Figure 19A-B for a comparion of wild type vs. 
codon optimized nucleotides comprising the open reading frame of HdV-Nef . 

30 The open reading frame for SEQ ID N0:9 above comprises an initiating 

methionine residue at nucleotides 12-14 and a *TAA" stop codon from nucleotides 
660-662. The open reading frame of SEQ ID N0:9 provides for a 216 amino acid 
mV-l Nef protein expressed through utilization of a codon optimized DNA vaccine 
vector. The 216 amino acid HIV-1 Nef (jfrl) protein is disclosed herein as SEQ ID 

35 NO: 10, and as follows: 

Met Gly Gly Lys Trp Ser Lys Arg Ser Val Pro Gly Trp Ser Thr Val 
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Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Arg Val Arg Arg 
Thr Glu Pro Ala Ala Val Gly Val Gly Ala Val Ser Arg Asp Leu Glu 
Lys His Gly Ala He Thr Ser Ser Asn Thr Ala Ala Thr Asn Ala Asp 
Cys Ala Trp Leu Glu Ala Gin Glu Asp Glu Glu Val Gly Phe Pro Val 
5 Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala Val Asp 
Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu He His 
Ser Gin Lys Arg Gin Asp He Leu Asp Leu Trp Val Tyr His Thr Gin 
Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly He Arg 
Phe Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val Glu Pro 
10 Glu Lys Val Glu Glu Ala Asn Glu Gly Glu Asn Asn Cys Leu Leu His 
Pro Met Ser Gin His Gly He Glu Asp Pro Glu Lys Glu Val Leu Glu 
Trp Arg Phe Asp Ser Lys Leu Ala Phe His His Val Ala Arg Glu Leu 
His Pro Glu Tyr Tyr Lys Asp Cys (SEQ ID NO: 10) . 

mV-l Nef is a 216 amino acid cytosolic protein which associates with the 

15 inner surface of the host cell plasma membrane through myristylation of Gly-2 

(Franchirii et al., 1986, Virology 155: 593-599). While not all possible Nef functions 
have been elucidated, it has become clear that correct trafficking of Nef to the inner . 
plasma membrane promotes viral replication by altering the host intracellular 
environment to facilitate the early phase of the HIV-l life cycle and by increasing die 

20 infectivity of progeny viral particles. In one aspect of the invention regarding 
codon-optimized, protein-modified polypeptides, the nef-encoding region of the 
adenovirus vector of the present invention is modified to contain a nucleotide 
sequence which encodes a heterologous leader peptide such that the anodno terminal 
region of the expressed protein will contain the leader peptide. The diversity of 

25 function that typifies eukaryotic cells depends upon the structural differentiation of 
their membrane boundaries. To generate and maintain these structures, proteins must 
be transported from their site of synthesis in the endoplasmic reticulum to 
predetermined destinations throughout the cell. This requires that the trafficking 
proteins display sorting signals that are recognized by the molecular inachinery 

30 responsible for route selection located at the access points to the main trafficking 
pathways. Sorting decisions for most proteins need to be made only once as they 
traverse their biosynthetic pathways since their final destination, the cellular location 
at which they perform their function, becomes their permanent residence. 
Maintenance of intracellular integrity depends in part on the selective sorting and 

35 accurate transport of proteins to their correct destinations. Defined sequence motifs 
exist in proteins which can act as 'address labels'. A number of sorting signals have 
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been found associated with the cytoplasmic domains of membrane proteins. An 
effective induction of CTL responses often required sustained, high level endogenous 
expression of an antigen. As membrane-association via myristylation is an essential 
requirement for most of Nefs function, mutants lacking myristylation, by glycine-to- 
5 alanine change, change of the dileucine motif and/or by substitution with a tpa leader 
sequence as described h^in, will be functionally defective, and therefore will have 
improved safety profile compared to wild-type Nef for use as an HIV-1 vaccine 
component 

In another embodiment of this portion of the invention, either the DNA vector 

10 or the HIV-1 nef nucleotide sequence is modified to include the human tissue-specific 
plasminogen activator (tPA) leader. As shown in Figure 16A-B, a DNA vector may 
be modified by known recombinant DNA methodology to contain a leader signal 
peptide of interest, such that downstream cloning of the modified HIV-1 protein of 
interest results in a nucleotide sequence which encodes a modified HIV-1 tPA/Nef 

15 protein. In the alternative, as noted above, insertion of a nucleotide sequence which 
encodes a leader peptide may be inserted into a DNA vector housing the open reading 
fiame for the Nef protein of interest Regardless of the cloning strategy, the end result 
is a polynucleotide vaccine which comprises vector components for effective gene 
expression in conjunction with nucleotide sequences which encode a modified HIV-l 

20 Nef protein of interest, including but not limited to a HIV- 1 Nef protein which 

contains a leader peptide. The amino acid sequence of the human tPA leader utilized 
herein is as follows: MDAMKRGLCCVLLLCGAVFVSPSEISS (SEQ ID NO: 17). 

It has been shown that myristylation of Gly-2 in conjunction with a dileucine 
motif in the carboxy region of the protein is essential for Nef-induced down 

25 regulation of CD4 (Aiken et al., 1994, Cell 16: 853-864) via endocytosis. It has also 
been shown that Nef expression promotes down regulation of MHCI (Schwartz et al., 
1996, Nature Medicine 2(3): 338-342) via endocytosis. The present invration relates 
in part to DNA vaccines which encode modified Nef proteins altered in trafficking 
and/or functional properties. The modifications introduced into the adenoviral vector 

30 HIV vaccines of the present invention include but are not limited to additions, 
deletions or substitutions to the nef open reading frame which results in the 
expression of a modified Nef protein which includes an amino terminal lead^ 
peptide, modification or deletion of the amino terminal myristylation site, and 
modification or deletion of the dileucine motif within the Nef protein and which alter 

35 function within tihe infected host cell. Th^fore, a central theme of tiie DNA 

molecules and recombinant adenoviral HIV vaccines of the present invention is (1) 



wo 02/22080 PCTAJSOl/28861 



host administration and intracellular delivery of a codon optimized nef-based 
adenoviral HIV vaccine; (2) expression of a modified Nef protein which is 
immunogenic in terms of eliciting both CTL and Th responses; and, (3) inhibiting or 
at least altering known early viral functions of Nef which have been shown to 
5 promote HIV-1 replication and load within an infected host. Therefore, the nef 

coding region may be altered, resulting in a DNA vaccine which expresses a modijSed 
Nef protein wherein the amino terminal Gly-2 myristylation residue is either deleted 
or modified to express alternate amino acid residues. Also, the nef coding region may 
be altered so as to result in a DNA vaccine which expresses a modified Nef protein 

10 wherein the dileucine motif is either deleted or modified to express alternate amino 
acid residues. In addition, the adenoviral vector HIV vaccines of the present 
invention also relate to an isolated DNA molecule, regardless of codon usage, which 
expresses a wild type or modified Nef protein as described herein, including but not 
limited to modified Nef proteins which comprise a deletion or substitution of Gly 2, a 

15 deletion or substitution of Leu 174 and Leu 175 and/or inclusion of a leader sequence. 
Therefore, specific Nef-based constructs further include the following, as 
exemplification 's and not limitations. For example, the present invention relates to an 
adenoviral vector vaccine which encodes modified forms of HTV-l, an open reading 
frame which encodes a Nef protein which comprises a tPA leader sequence fused to 

20 amino acid residue 6-216 of HIV-1 Nef (jfrl) is referred to herein as opt tpanef. The 
nucleotide sequence comprising the open reading frame of opt tpanef is disclosed 
herein as SEQ ID NO: 1 1 , as shown below: 





CATGGATGCA 


ATGAAGAGA6 


GGCTCTGCTG 


TGTGCTGCTG 


CTGTGTGGAG 


CAGTCTTCGT 




TTC6CCCAGC 


GAGATCTCCT 


CCAAGAGGTC 


CGTGCCCGGC 


TGGTCCACCG 


TGA6G6AGAG 


25 


GATGAGGAGG 


GCCGAGCCCG 


CCGCCGACAG 


GGTGAGGAG6 


ACCGAGCCCG 


CCGCCGTGGG 




CGTGGGCGCC 


GTGTCCAGGG 


ACCT6GA6AA 


GCACGGCGCC 


ATCACCTCCT 


CCAACACCGC 




CGCCACCAAC 


GCCGACTGC6 


CCTGGCTGGA 


GGCCCAGGAG 


GACGAGGAGG 


TGGGCTTCCC 




CGTGAGGCCC 


CAGGTGCCCC 


TGAQGCCCAT 


GACCTACAAG 


GGCGCCGTGG 


ACCTGTCCCA 




CTTCCTGAAG 


GAGAAGGGCG 


GCCTGGAGGG 


CCTGATCCAC 


TCCCA6AAGA 


GGCAGGACAT 


30 


CCTGGACCTG 


TGGGTGTACC 


ACACCCAGGG 


CTACTTCCCC 


GACTGGCAGA 


ACTACACCCC 




CGGCCCCGGC 


ATCAGGTTCC 


CCCTGACCTT 


CGGCTGGTGC 


TTCAAGCTGG- 


TGCCCGTGGA 




GCCC6AGAAG 


6TGGAGGAGG 


CCAACGA6GG 


CGAGAACAAC 


TGCCTGCTGC 


ACCCCATGTC 




CCAGCACGGC 


ATCGAGGACC 


CCGAGAAGGA 


GGTGCTGGAG 


TGGAGGTTCG 


ACTCCAAGCT 




GGCCTTCCAC 


CACGTGGCCA 


GGGA6CTGCA 


CCCCGAGTAC 


TACAAGGACT 


GCTAAAGCC 


35 


(SEQ IB NO: 


11). 











The open reading frame for SEQ ED NO: 11 comprises an initiating methionine 

- m 
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residue at nucleotides 2-4 and a *TAA" stop codon from nucleotides 713-715. The 
open reading frame of SEQ ID NO:3 provides for a 237 amino acid HIV-1 Nef 
piotein which comprises a tPA leader sequence fused to amino acids 6-216 of HIV-1 
Nef, including the dileucine motif at amino acid residues 174 and 175. This 237 
5 amino acid tPA/Nef (jfrl) fusion protein is disclosed herein as SEQ ID NO: 12, and is 





shown as follows: 






• 




Met 


Asp Ala Met Lys 


Arg 


Gly 


Leu Cys Cys Val Leu Leu Leu Cys Gly 




Ala 


Val Phe Val Ser 


Pro 


Ser 


Glu He Ser Ser Lys Arg Ser Val Pro 




Gly 


Trp Ser Thr Val 


Arg 


Glu 


Arg Met Arg Arg Ala Glu Pro Ala Ala 


10 


Asp 


Arg Val Arg Arg 


Thr 


Glu 


Pro Ala Ala Val Gly Val Gly Ala Val 




Ser 


Arg Asp Leu Glu 


Lys 


His 


Gly Ala He Thr Ser Ser Asn Thr Ala 




Ala 


Thr Asn Ala Asp 


Cys 


Ala 


Trp Leu Glu Ala Gin Glu Asp Glu Glu 




Val 


Gly Phe Pro Val 


Arg 


Pro 


Gin Val Pro Leu Arg Pro Met Thr Tyr 




Lys 


Gly Ala Val Asp 


Leu 


Ser 


His Phe Leu Lys Glu Lys Gly Gly Leu 


15 


Glu 


Gly Leu lie His 


Ser 


Gin 


Lys Arg Gin Asp He Leu Asp Leu Trp 




Val 


Tyr His Thr Gin 


Gly 


Tyr 


Phe Pro Asp Trp Gin Asn Tyr Thr Pro 




Gly 


Pro Gly He Arg 


Phe 


Pro 


Leu Thr Phe Gly Trp Cys Phe Lys Leu 




Val 


Pro Val Glu. Pro 


Glu 


Lys 


Val Glu Glu Ala Asn Glu Gly Glu Asn 




Asn 


Cys Leu Leu His 


Pro 


Met 


Ser Gin His Gly He Glu Asp Pro Glu 


20 


Lys 


Glu Val Leu Glu 


Trp 


Arg 


Phe Asp Ser Lys Leu Ala Phe His His 




Val 


Ala Arg Glu Leu 


His 


Pro 


Glu Tyr Tyr Lys Asp Cys (SEQ ID NO: 12) . 



Therefore, this exemplified Nef protein, Opt tPA-Nef , contains both a tPA leader 
sequence as well as deleting the myristylation site of Gly-2A DNA molecule encoding 
HIV-1 Nef from the HTV-l jfirl isolate wherein the codons are optimized for 

25 expression in a mammalian system such as a human. 

In another specific embodiment of the present invention, a DNA molecule is 
disclosed which encodes optimized HTV-l Nef wherein the open reading frame of a 
recombinant adenoviral HIV vaccine encodes for modifications at the amino tenninal 
myristylation site (Gly-2 to Ala-2) and substitution of the Leu-174-Leu-175 dileucine 

30 motif to Ala-174-Ala-175. This open reading frame is herein described as opt nef 
(G2AJ-LAA) and is disclosed as SEQ ID NO: 13, which comprises an initiating' 
methionine residue at nucleotides 12-14 and a 'TAA" stop codon fix)m nucleotides 
660-662. The nucleotide sequence of this codon optimized version of HIV-l jrfl nef 
gene with the above mentioned modifications is disclosed herein as SEQ ID NO: 13, 

35 as foUows: 
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GATCTGCCAC CATGGCCGGC AAGTGGTCCA AGAGGTCCGT GCCCGGCTGG TCCACCGTGA 
GGGAGAGGAT GAGGAGG6CC GAGCCCGCCG CCGACAGGGT GAGGAGGACC GAGCCCGCCG 
CCGTGGGCGT GGGCGCCGTG TCCAGGGACC TGGAGAAGCA CGGCGCCATC ACCTCCTCCA 
ACACCGCCGC CACCAACGCC GACTGCGCCT GGCTGGAGGC CCAGGAGGAC GAGGAGGTGG 
5 GCTTCCCCGT GAGGCCCCAG GTGCCCCTGA GGCCCATGAC CTACAAGGGC "GCCGTGGACC 
TGTCCCACTT CCTGAAGGAG AAGGGCGGCC TGGAGGGCCT GATCCACTCC CAGAAGAGGC 
AG6ACATCCT 6GACCTGTGG GTGTACCACA CCCAGGGCTA CTTCCCCGAC TGGCAGAACT 
ACACCCCCGG CCCCGGCATC AGGTTCCCCC TGACCTTCGG CTGGTGCTTC AAGCTGGTGC 
CCGTGGAGCC C6AGAAGGTG GAGGAGGCCA ACGAGGGCGA GAACAACTGC GCCGCCCACC 
10 CCATGTCCCA GCACGGCATC GAGGACCCCG AGAAGGAGGT GCTGGAGTGG AGGTTCGACT 
CCAAGCTGGC CTTCCACCAC GTGGCCAGGG AGCTGCACCC CGAGTACTAC AAGGACTGCT 
AAAGCCCGGG C (SEQ ID NO: 13) . 

The open reading frame of SEQ ID NO: 13 encodes Nef (G2A,LLAA), 
disclosed herein as SEQ ID NO: 14, as follows: 

15 Met Ala Gly Lys Trp Ser Lys Arg Ser Val Pro Gly Trp Ser 
Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Arg Val 
Thr Glu Pro Ala Ala Val Gly Val Gly Ala Val Ser Arg Asp 
Lys His Gly Ala lie Thr Ser Ser Asn Thr Ala Ala Thr Asn 
Cys Ala Trp Leu Glu Ala Gin Glu Asp Glu Glu Val Gly Phe 

20 Arg Pro Gin Val Pro Leu Arg Pro Met Thr Tyr Lys Gly Ala 
Leu Ser His Phe Leu Lys Glu Lys Gly Gly Leu Glu Gly Leu 
Ser Gin Lys Arg Gin Asp lie Leu Asp Leu Trp Val Tyr His 
Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr Pro Gly Pro Gly 
Phe Pro Leu Thr Phe Gly Trp Cys Phe Lys Leu Val Pro Val 

25 Glu Lys Val Glu Glu Ala Asn Glu Gly Glu Asn Asn Cys Ala 
Pro Met Ser Gin His Gly lie Glu Asp Pro Glu Lys Glu Val 
Trp Arg Phe Asp Ser Lys Leu Ala Phe His His Val Ala Arg 
His Pro Glu Tyr Tyr Lys Asp Cys Ser (SEQ ID NO: 14) . 

An additional embodiment of the present invention relates to another DNA 
30 molecule encoding optimized HTV-l Nef wherein the amino terminal myristylation 
site and dileucine motif have been deleted, as well as comprising a tPA leader peptide. 
This DNA molecule, opt tpanef CULAA) comprises an open reading frame which 
encodes a Nef prptein containing a tPA leader sequence fused to amino acid residue 
6-216 of HIV-1 Nef (jfrl), wherein Leu-174 and Leu-175 are substituted with Ala-174 
35 and Ala-175 (Ala-195 and Ala-196 in this tPA-based fusion protein). The nucleotide 



Thr Val 
Arg Arg 
Leu Glu 
Ala Asp 
Pro Val 
Val Asp 
He His 
Thr Gin 
He Arg 
Glu Pro 
Ala His 
Leu Glu 
Glu Leu 
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sequence comprising the open reading frame of opt tpanef (LLAA) is disclosed h^in 
as SEQ ID NO: 15, as shown below: 

CATGGATGCA ATGAAGAGAG GGCTCTGCTG TGTGCTGCTG CTGTGTGGAG CAGTCTTCGT 
TTCGCCCAGC GAGATCTCCT CCAAGAGGTC CGTGCCCG6C TGGTCCACCG TGAGGGAGAG 
5 GATGAGGAGG GCCGAGCCCG CCGCCGACAG GGTGAGGAGG ACCGAGCCCG CCGCCGTGGG 
CGTGGGCGCC GTGTCCAGGG ACCTGGAGAA GCACGGCGCC ATCACCTCCT CCAACACCGC 
CGCCACCAAC GCCGACTGCG CX:TGGCTGGA GGCCCAGGAG GACGAGGAGG TGGGCTTCCC 
CGTGAGGCCC CAGGTGCCCC T6AGGCCCAT GACCTACAAG G6CGCCGTGG ACCTGTCCCA 

cttcctgaag gagaagggcg gcctggaggg cctgatccac tcccagaaga ggcaggacat 

10 CCTGGACCTG TGGGTGTACC ACACCCAGG6 CTACTTCCCC GACTGGCAGA ACTACACCCC 
CGGCCCCGGC ATCAGGTTCC CCCTGACCTT CGGCTGGTGC TTCAAGCTGG TGCCCGTGGA 
GCCCGAGAAG GTGGAGGAGG CCAACGAGGG CGAGAACAAC TGCGCCGCCC ACCCCATGTC 
CCAGCACGGC ATCGAGGACC CCGAGAAGGA GGTGCTGGA6 TGGAGGTTCG ACTCCAAGCT 
GGCCTTCCAC CACGTGGCCA GGGAGCTGCA CCCCGAGTAC TACAAGGACT GCTAAAGCCC 

15 (SEQ ID NO: 15) . 

The open reading frame of SEQ ID N0:7 encoding tPA-Nef (LLAA), 
disclosed herein as SEQ ID NO: 16, is as follows: 





Met 


Asp 


Ala 


Met 


Lys 


Arg 


Gly 


Leu 


Cys 


Cys 


Val 


Leu 


Leu 


Leu 


Cys 


Gly 




Ala 


Val 


Phe 


Val 


Ser 


Pro 


Ser 


Glu 


He 


Ser 


Ser 


Lys 


Arg 


Ser 


Val 


Pro 


20 


Gly 


Trp 


Ser 


Thr 


Val 


Arg 


Glu 


Arg 


Met 


Arg 


Arg 


Ala 


Glu 


Pro 


Ala 


Ala 




Asp 


Arg 


Val 


Arg 


Arg 


Thr 


Glu 


Pro 


Ala 


Ala 


Val 


Gly 


Val 


Gly 


Ala 


Val 




Ser 


Arg 


Asp 


L»eu 


Glu 


Lys 


His 


Gly 


Ala 


He 


Thr 


Ser 


Ser 


Asn 


Thr 


Ala 




Ala 


Thr 


Asn 


Ala 


Asp 


Cys 


Ala 


Trp 


Leu 


Glu 


Ala 


Gin 


Glu 


Asp 


Glu 


Glu 




Val 


Gly 


Phe 


Pro 


Val 


Arg 


Pro 


Gin 


Val 


Pro 


Leu 


Arg 


Pro 


Met 


Thr 


Tyr 


25 


Lys 


Gly 


Ala 


Val 


Asp 


Leu 


Ser 


His 


Phe 


Leu 


Lys 


Glu 


Lys 


Gly 


Gly 


Leu 




Glu 


Gly 


Leu 


lie 


His 


Ser 


Gin 


Lys 


Arg 


Gin 


Asp 


He 


Leu 


Asp 


Leu 


Trp 




Val 


Tyr 


His 


Thr 


Gin 


Gly 


Tyr 


Phe 


Pro 


Asp 


Trp 


Gin 


Asn 


Tyr 


Thr 


Pro 




Gly 


Pro 


Gly 


lie 


Arg 


Phe 


Pro 


Leu 


Thr 


Phe 


Gly 


Trp 


Cys 


Phe 


Lys 


Leu 




Val 


Pro 


Val 


Glu 


Pro 


Glu 


Lys 


Val 


Glu 


Glu 


Ala 


Asn 


Glu 


Gly 


Glu. 


Asn 


30 


Asn 


Cys 


Ala 


Ala 


His 


Pro 


Met 


Ser 


Gin 


His 


Gly 


He 


Glu 


Asp 


Pro 


Glu 




Lys 


Glu 


Val 


Leu 


Glu 


Trp 


Arg 


Phe 


Asp 


Ser 


Lys 


Leu 


Ala 


Phe 


His 


His 




val 


Ala 


Arg 


Glu 


Leu 


His 


Pro 


Glu 


Tyr 


Tyr 


Lys 


Asp 


Cys 


(SEQ ID 


N0:16) . 



An adenoviral vector of the present invention may comprise a DNA sequence, 
regardless of codon usage, which expresses a wild type or modified Nef protein as 
35 described herein, including but not limited to modified Nef proteins which comprise a 
deletion or substitution of Gly 2, a deletion of substitution of Leu 174 and Leu 175 
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and/or inclusion of a leader sequence. Therefore, partial or fully codon optiixdzed 
DNA vaccine expression vector constructs are preferred since such constructs should 
result in increased host expression. However, it is within the scope of the present 
invention to utilize "non-K;odon optimized" versions of the constructs disclosed herein, 
5 especially modified versions of HIV Nef which are shown to promote a substantial 
ceUular inmiune response subsequent to host administration. 

Figure 20A-C show nucleotide sequences at junctions between nef coding 
sequence and plasmid backbone of nef expression vectors VlJns/nef (Figure 20A), 
VlJns/nef(G2A,LlAA) (Figure 20B), VI Jns/tpanef (Figure 20C) and 

10 VI Jns/tpanef (LLAA) (Figure 20C, also). 5' and 3' flanking sequences of codon 

optimized nef or codon optimized nef mutant genes are indicated by bold/italic letters; 
nef and nef mutant coding sequences are indicated by plain letters. Also indicated (as 
underlined) are the restriction endonuclease sites involved in construction of 
respective nef expression vectors. VlJns/tpanef and VlJns/tpanef(LLAA) have 

15 identical sequences at the junctions. 

Figure 21 shows a schematic presentation of nef and nef derivatives. Amino 
acid residues involved in Nef derivatives are presented. Glycine 2 and Leucinel74 
and 175 are the sites involved in myristylation and dileucine motif, respectively. 

20 EXAMPLE 19 

MRKAdSPol Construction and Virus Rescue 
Construction of vector: slmttle plasmid and pre-adenovirus plasmid - Key 
steps performed in the construction of the vectors, including the pre-adenovirus 
plasmid denoted MRKAd5p6l, is depicted in Hgure 22. Briefly, the adenovkal shuttle 

25 vector for the full-length inactivated HTV-l pol gene is as follows. The vector 

MRKpdelEl(Pac/pIX/pack450)H<MVmin+BGHpA(str.) is a derivative of the shutfle 
vector used in the construction of the MRKAd5gag adenoviral pre-plasmid The 
vector contains an expression cassette with the hCMV promoter (no intronA) and the 
bovine growth hormone polyadenylation signal. The expression unit has been 

30 inserted into the shuttle vector such that insertion of the gene of choice at a unique 
BglR site will ensure the direction of transcription of the transgene will be Ad5 El 
parallel when inserted into the MRKpAd5(El -yE3+)Clal (or MRKpAdHVES) pre- 
plasmid. The vector, similar to the original shuttle vector contains the Pacl site, 
extension to the packaging signal region, and extension to the pIX gene. The synthetic 

35 full-length codon-optimized HTV-l pol gene was isolated directly ficom the plasmid 
pVlJns-HIV-pol-inact(opt). Digestion of this plasmid with Bgl II releases the pol 
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gene intact (comprising a codon optimized lA pol sequence as disclosed in SEQ ID 
NO:3). Ilie pol fragment was gel purified and ligated into the 
MRKpdelEl(Pac/pIX/pack450)4<MVmin+BGHpA(st^ shuttle vector at the BglSL 
site. The clones were checked for the correct orientation of the gene by using 
5 restriction enzymes DrdSUNoth A positive clone was isolated and named 

MRKpdel+hCMVmin+FL-pol+bGHpA(s). The genetic structure of this plasmid was 
verified by PGR, restriction enzyme and DNA sequencing. The pre-adenovirus 
plasmid was constructed as follows. Shutde plasmid MRKpdel+hCMVnoin+EL- 
pol+bGHpA(S) was digested with restriction enzymes Pad and BstllQl I (or its 

10 isoschizomer, BstZlOl I) and then co-transformed into £ coli strain BJ5183 with 
linearized {Clal digested) adenoviral backbone plasmid, MRKpAdCEl-yE3+)Clal . 
The resulting pre-plasmid originally named MRKpAd+hCMVmin+HL- 
pol+bGI^A(S)E3+ is now referred to as "pMRKAdSpol". The genetic stracture of 
the resulting pMRKAdSpol was verified by PGR, restriction enzyme and DNA 

15 sequence analysis. The vectors were transformed into competent E. coli XL-1 Blue 
for preparative production. The recovered plasmid was verified by restriction enzyme 
digestion and DNA sequence analysis, and by expression of the pol transgene in 
transient transfection cell culture. Tbe complete nucleotide sequence of this 
pMRKAdSmV-lpcl adenoviral vector is shown in Figure 26 A-AO. 

20 Generation of research-grade recombinant adenovirus - The pre-adenovirus 

plasmid, pMRKAdSpol, was rescued as infectious virions in PER.G6® adherent 
monolayer cell culture. To rescue infectious virus, 12 |Ltg of pMRKAdSpol was 
digested with restriction enzyme PacI (New England Biolabs) and 3.3 \ig was 
transfected per 6 cm dish of PER.G6® cells using the calcium phosphate co- 

25 precipitation technique (Cell Phect Transfection Kit, Amersham Pharmacia Biotech 
Inc.). PacI digestion releases the viral genome from plasmid sequences allowing viral 
replication to occur after entry into PER.C6®cells. Infected cells and media were 
harvested 6 -10 days post-transfection, after complete viral cytopathic effect (CPE) 
was observed. Infected ceUs and media were stored at ^ -60**G. This pol containing 

30 recombinant adenovims is referred to herein as "MRKAdSpol". This recombinant 
adenovirus expresses an inactivated HTV-l Pol protein as shown in SEQ ID N0:6. . 

EXAMPLE20 
MRKAdSNef Construction and Virus Rescue 
35 Construction of vector: shuttle plasmid and pre-adenovirus plasmid - Key 

steps performed in the constraction of the vectors, including the pre-adenovirus 
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plasmid denoted MRKAdSnef, is depicted in Figure 23. Bnefly, as shown in 
Example 19 above, the vector 

MIlKpdeffil(Pac/pIX/pack450)4<3MVmm is tfie shuttle vector used in 

the construction of the MRKAdSgag adenoviral pre-plasmid. It has been modified to 
5 contain the Pacl site, extension to the packaging signal region, and extension to the 
pDC gene. It contains an expression cassette with the hCMV promoter (no intronA) 
and the bovine growth hormone polyadenylation signal. The expression unit has been 
inserted into the shuttle vector such that insertion of the gene of choice at a unique 
Bgll 1 site will ensure the direction of transcription of the transgene will be Ad5 El 

10 parallel when inserted into the MRKpAd5(El-/E3+)Clal pre-plasmid. The synthetic 
full-length codon-optimized HIV-1 nef gene was isolated directly from the plasmid 
pVlJns/nef (G2A,LLAA). Digestion of this plasmid with Bglll releases the pol gene 
intact, which comprises the nucleotide sequence as disclosed in SEQ ID NO: 13. The 
nef fragment was gel purified and ligated into the 

15 MRKpdelEl+CMVmin+BGHpA(str.) shutde vector at the Bgll 1 site. The clones 
were checked for correction orientation of the gene by using restriction enzymt Seal. 
A positive clone was isolated and named MRKpdelElhCMVmiiiFI^nefBGHpA(s). - - 
The genetic structure of this plasmid was verified by PGR, restriction enzyme and 
DNA sequencing. The pre-adenovirus plasmid was constructed as follows. Shuttle 

20 plasmid MRKpdelElhCMVininFL-nefBGHpA(s) was digested with restriction 

enzymes Pacl and BstllOl I (or its isoschizomer, BstZlOl T) and then co-transformed 
into E, coli strain BJ5183 with linearized {Clal digested) adenoviral backbone 
plasmid, MIlKpAd(El/E3+)Clal. The resulting pre-plasmid originally named 
MRKpdelElhCMVminEL-nefBGHpA(s) is now referred to as "pMRKAdSnef The 

25 genetic structure of the resulting pMRKA.d5nef was verified by PGR, restriction 

enzyme and DNA sequence analysis. The vectors were transformed into competent E. 
coli XL-1 Blue for preparative production. The recovered plasmid was verified by 
restriction enzyme digestion and DNA sequence analysis, and by expression of the 
nef transgene in transient transfection cell culture. The complete nucleotide sequence 

30 of this pMRKAd5HIV-lnef adenoviral vector is shown in Figure 27A-AM. 

Generation of research-grade recombinant adenovirus - The pre-adenovirus 
plasmid, pMRKAdSnef, was rescued as infectious virions in PER.C6® adherent 
monolayer cell culture. To rescue infectious virus, 12 jxg of pMKKAdnef was 
digested with restriction enzyme Pacl (New England Biolabs) and 3.3 jig was 

35 transfected per 6 cm dish of PER.C6® cells using the calcium phosphate co- 
precipitation technique (Cell Phect Transfection Kit, Amersham Pharmacia Biotech 
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Inc.). Pacl digestion releases the viral genome from plasmid sequences allowing 
viral replication to occur after entry into PER.C6®cells. Infected cells and media were 
harvested 6-10 days post-transfection, after complete viral cytopathic effect (CPE) 
was observed. Infected cells and media were stored at < -60°C. This nef containing 
5 recombinant adenovirus is now referred to as "MRKAdSnef \ 

EXAMPLE 21 

Construction of Murine CMV Promoter Containing Shutde Vectors 

for Inactivated Pol and Nefi^G2Aj:-LAA 

10 The murine CMV (mCMV) was amplified fix)m the plasmid pMH4 (supplied 

by Frank Graham, McMaster University) using the primer set: mCMV (Not T) 
Forward: 5'-ATA AGA ATGCGGCCGCCA TAT ACT GAG TCA TTA GG-3^ 
(SEQ ID NO: 20); mCMV (Bgl n)Reverse: 5'-AAG G AAGATCT A CCG ACG 
CTG GTC GCG CCT C-3' (SEQ ID NO:21). The underlined nucleotides represent 

15 the Not I and the Bgl II sites respectively for each primer. This PGR amplicon was 
used for the construction of the mCMV shuttle vector containing the transgene in the 
El parallel orientation. The hCMV promoter was removed from the original shuttle 
vector (containing the hCMV-gag-bGHpA transgene in the El parallel orientation) by 
digestion with Not I and Bgl IL The mCMV promoter (Not VBgl U digested PGR 

20 product) was inserted into the shuttle vector in a directional manner. The shuttle 
vector was then digested with Bgl II and the gag reporter gene (Bgl H fragment) was 
re-inserted back into the shutde vector. Several clones were screened for correct 
orientation of the reporter gene. For the construction of the mCMV-gag in the El 
antiparallel orientation, the mCMV promoter was amplified from the plasmid pMH4 

25 using the following primer set: mCMV (Asc I) Forward: 5'- ATA AGA ATGGCG 
CGCCA T ATA CTG AGT CAT TAG G (SEQ ID NO:22); mCMV (JSgl H) Reverse: 
5' AAG GAAGATCTA CCG ACG CTG GTC GCG CCT C (SEQ ID NO:23). The 
underlined nucleotides represent the Asc I and Bgl U sites, respectively for each 
primer. The shutde vector containing the hCMV-gag transgene in the El antiparallel 

30 orientation was digested with A^cl and Bgll 1 to remove the hCMV-gag portion of the 
transgene. The mCMV promoter {Ascl/BglU digested PCR product) was inserted 
into the shutde vector in a directional manner. The vector was tiien digested with 
BglU and the gag reporter gene (Bglll fi^igment) was re-inserted. Several clones 
were screened for correct orientation of the reporter gene. For each of the full length 

35 lA pol and full length ne^G2A,IiAA genes, cloning was performed using die unique 
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Bgl n site within the mCMV-bGHpA shuttle vector. The pol and nef genes were 
excised from their respective pVl Jns plasmids by Bgl U digestion. 

EXAMPLE22 

5 Construction of mGMV Full Length Inactivated Pol and 

Full Lengfli ne^G2AXLAA Adenovectors 
Each of these transgenes of Example 21 were inserted into the modified 
shuttle vector in both the El parallel and El anti-parallel orientations. Pacl and 
BsiZllOl digestion of each shuttle vector was performed and each specific transgene 
10 fragment containing the flanking Ad5 sequences was isolated and co-transformed 
with Cla 1 digested MRKpAd5(E3+) or MRKpAd5(E3-) adeiiovector plasmids via 
bacterial homologous recombination in BJ5183 E. coli cells. Recombinant pre- 
plasmid adenovectors containing the various transgenes in both the E3- and E3+ 
versions (and in the El parallel and El antiparallel orientations) were subsequently 
15 prepared in large scale following transformation into XL-1 Blue £1 coli cells and 
analyzed by restriction analysis and sequencing. 

EXAMPI£23 
Construction of hCMV-tpa-nef (LLAA) Adenovector 

20 The tpa-nef gene was amplified out from GMP grade pVlJns-tpanef (LLAA) vector 
using the primer sets: Tpanef CBamHI) F 5'-ATT GGATCC ATG GAT OCA ATG 
AAG AGA GGG rSEO ID 24V Tpanef (BarnHL R 5^-ATA GGA TCC TTA GCA 
GTC CTT GTA GTA CTC G (SEQ ID NO:25). The resulting PGR product was 
digested with BattiESL, gel purified and cloned into the Bgl n site of MRKAdSCMV- 

25 bGfi^A shutde vector (JBgl II digested and calf intestinal phosphatase treated). 

Clones containing the tpanef (LLAA) gene (see SEQ ID NO: 15 for complet coding 
region) in the correct orientation with respect to the hCMV promoter were selected 
following Sea I digestion. The resulting MRKAd5tpanef shuttle vector was digested 
with Pac I and Bst Zl 101 and cloned into the E3+ MRKAdS adenovector via bacterial 

30 homologous recombination techniques. 

EXAMPLE 24 

Immunogenicity of MRKAdSpol and MRKAdSnef Vacdne 
Materials and Methods - Rodent Immunization - Groups of N=10 BALB/c 
35 mice were immunized i.m. with the following vectors: (1) MKKLAdShCMV-IApol 
(E3+) at either 10^7 vp and 10^9 vp; and (2) MRKAdShCMV-IApol 0B3-) at either 
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10^7 vp and 10^9 vp. At 7 weeks post dose, 5 of the 10 mice per cohort were boosted 
with the same vector and dose they initially received At 3 weeks post the second 
does, sera and spleens were collected firom all the animals for RT ELIS A and IFNg 
EUspot analyses, respectively. For all rodent immunizations, the Ad5 vectors were 
5 diluted in 5 mM Tris, 5% sucrose, 75 mM NaCl, 1 mM MgC12, 0.005% polysorbate 
80, pH 8.0. The total dose was injected to both quadricep muscles in 50 fjL aliquots 
using a 0.3-mL insulin syringe with 28-1/2G needles (Becton-Dickinson, Franklin 
Lakes, NJ). 

Groups of N=10 C57/BL6 mice were immunized i.m. with the following 
10 vectors: (1) MRKAd5hCMV-nef(G2AJULAA) (E3+) at either 10^7 vp and 10^9 vp; 
(2) MRKAd5mCMV-nef(G2AJJLAA) (E3+) at either 10^7 vp and 10^9 vp; and (3) 
MRKAdSmCMV-tpanefCIlAA) (E3+) at either 10^7 vp and 10^9 vp. At 7 weeks . 
post dose, 5 of the 10 mice per cohort were boosted with the same vector and dose 
they initially received. At 3 weeks post the second does, sera and spleens were 
15 collected from all the animals for RT ELIS A and BFNg ELIspot analyses, 
respectively. 

Non-hunian Primate immunization - Cohorts of 3 rhesus macaques (2-3 kg) . 
were vaccinated with the following Ad vectors: (1) MRKAd5hCMV-IApol (E3+) at 
either 10'^9 vp and lO'^ll vp dose; and (2) MRKAdShCMV-IApol (E3-) at eiflier 

20 10^9 vp and 10^11 vp; (3) MRKAd5hCMV-nef(G2AJLLAA) (E3+) at either 10^9 vp 
and 10^11 vp; and (4) MRKAd5mCMV-nef(G2AJLLAA) (E3+) at either 10'^9 vp and 
10^1 1 vp. The vaccine was administered to chemically restrained monkeys (10 
mg/kg ketamine) by needle injection of two 0.5 mL aliquots of the Ad vectors (in 5 
mM Tris. 5% sucrose, 75 mM NaCl, 1 mM MgCU, 0.005% polysorbate 80, pH 8.0) 

25 into both deltoid muscles. The animals were immunized twice at a 4 week interval 
(T=0, 4 weeks). 

Murine anti-RT and onti-nefEUSA - Anti-RT titers were obtained following 
standard secondary antibody-based ELES A. Maxisorp plates (NUNC, Rochester; NY) 
were coated by overnight incubation with 100 {xL of 1 |xg /mL HIV-1 RT protein 

30 (Advanced Biotechnologies, Columbia, MD) in PBS. For anti-nef ELIS A, 100 uL of 1 
ug/mL HLV-l nef (Advanced Biotechnologies, Columbia, MD) was used to coat the 
plates. The plates were washed. with PBS/0.05% Tween 20 using Titertek MAP 
instrument (HunstviUe, AL) and incubated for 2 h with 200 pL/well of blocking 
solution (FBS/0.05% tween/1% BSA). An initial serum dilution of 100-fold was 

35 performed followed by 4-fold serial dilution. lOO-^L aliquots of serially diluted 
samples were added per weU and incubated for 2 h at room temperature. The plates 
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were washed and 100 )xL of 1/1000-diluted HElP-rabbit anti-mouse IgG (ZYMED, 
San Francisco, CA) were added with 1 h incubation. The plates were washed 
thoroughly and soaked with 100 jJiL 1,2-phenylenediaimne dihydrochloiide/hydrogen 
peroxide (DAKO, Norway) solution for IS ndn. The reaction was quenched by 
5 adding 100 pL of 0.5M H2SG4 per well. - OD492 readings were recorded using Titertek 
Multiskan MCC/340 with S20 stacker. Endpoint titers w^ defined as the highest 
serum dilution that resulted in an absorbance value of greater than or equal to 0.1 
OD492 (2.5 times the background value). 

Non-human primate and murine EUspot assays - The enzyme-linked 

10 immuno-spot OBLISpot) assay was utilized to enumerate antigen-specific INf^- 
secreting cells from mouse spleens (Miyahira, et al.l995, /. Immunol Methods 
181:45-54) or macaque PBMCs. Mouse spleens were pooled from 5 mice/cohort and 
single cell suspensions were prepared at 5xlO^/mL in complete RPME media 
(RPMI1640, 10% FBS, 2mM l^glutamine, lOOU/mL Penicillin, 100 u/mL 

15 streptomycin, 10 mM Hepes, 50 uM P-ME). Rhesus PBMCs were prepared from 8- 
15 mL of hepaiinized blood following standard Hcoli gradient separation (Coligan, et 
al, 1998, Current Protocols in Immunology. John Wiley & Sons, Inc.). Multiscreen 
opaque plates (Millipore, France) were coated with 100 piUwell of either 5 \ig/mL 
purified rat anti-mouse IFN-y IgGl, clone R4-6A2 ffharmingen, San Diego, CA), or 

20 15 ug/mL mouse anti-human IFN-y IgGjia (Cat. No. 1598-00, R&D Systems, 
Minneapolis. MN) in PBS at 4°C overnight for murine or monkey assays, 
respectively. The plates were washed with PBS/penicillin/streptomycin and blocked 
with 200 plVwell of complete RPMI media for 37 °C for at least 2 h. 

To each well, 50 \iL of cell samples (4-5x10^ cells per well) and 50 ML of the 

25 antigen solution were added. To the control well, 50 jiL of the media containing 

DMSO were added; for specific responses, either selected peptides or peptide pools (4 
ug/mL per peptide final concentration) were added. For B ALB/c mice immunized . 
with the pol constructs, stimulation was conducted using a pool of CD4'^-epitope 
containing 20-mer peptides (aa21-40, aa41 1-430, aa641-660, aa73 1-750, aa771-790) 

30 or a pool of CDS'^^-epitope containing peptides (aa201-220, aa311-330, aa781-800). 
For C57/BL6 nrice immunized with the nef construct, either aa51-70 (CDS"*" T cell 
epitope) or aa81-100 (CD4^ peptide derived from the nef sequence was added for 
specific stimulation. In monkeys, the responses against pol were evaluated using two 
pools (L and R) of 20-aa peptides that encompass the entire pol sequence and overlap 

35 by 10 amino acids. In monkeys vaccinated with the nef constructs, a single pool 
containing 20-mer peptides covering the entire HIV-1 nef sequence and overlapping 
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by 10 aa was used. Each sample/antigen mixture was performed in triplicate wells for 
murine samples or in duplicate wells for riiesus PBMCs. Plates were incubated at 
37°C, 5% CO2, 90% humidity for 20-24 h. The plates were washed with PBS/0.05% 
Tween 20 and incubated with 100 jiL/well of either 1.25 jxg/mL Wotm-conjugated rat 
5 anti-mouse IFN-y mAb, clone XMGl .2 (Pharmingen) or of 0. 1 ug/mL biotinylated 
anti-human lEN-gamma goat polyclonal antibody (R&D Systems) at 4°C overnight. 
The plates were washed and incubated with 100 nlVweU 1/2500 dilution of 
stiepavidin-alkaline phosphatase conjugate (Pharmingen) m PBS/0.005% Tween/5% 
FBS for 30 min at 37 °C. Spots were developed by incubating with 100 (iL/well 1- 

10 step NBT/BCff (Pierce Chemicals) for 6-10 min. The plates were washed with water 
and allowed to air dry. The number of spots in each well was determined using a 
dissecting microscope and the data normalized to 10 cell input. 

Non-human Primate anti-RTEUSA - The pol-specific antibodies in the 
monkeys were measured in a competitive RT EIA assay, wherein sample activity is 

15 determined by the ability to block RT antigen from binding to coating antibody on the 
plate well. Briefly, Maxisorp plates were coated with saturating amounts of pol 
positive human serum (#971 11234). 250 uL of each sample is incubated with 15 uL 
of 266 ng/mL RT recombinant protein (in RCM 563, 1% BSA, 0.1% tween, 0.1% 
NaNa) and 20 uL of lysis buffer (Coulter p24 antigen assay kit) for 15 n±i at room 

20 temperature. Similar mixtures are prepared using serially diluted samples of a 

standard and a negative control which defines maximum RT binding. 200 uL/weU of 
each sample and standard were added to the washed plate and the plate incubated 16- 
24 h at room temperature. Bound RT is quantified following the procedures described 
in Coulter p24 assay kit and reported in milliMerck units per mL arbitrarily defined 

25 by the chosen standard. 

Results - Rodent Studies - B ALB/c mice (n=5 mice/cohort) were immimized 
once or twice with varying doses of MRKAd5hCMV-IApol(E3+) and 
MRKAd5hCMV-IApol(E3-). At 3 weeks after the second dose, Anti-pol IgG levels 
werc determined by an ELIS A assay using RT as a surrogate antigen. Cellular 

30 response were quantified via IFNy ELISpot assay against pools of pol-epitope 
containing peptides. Hie results of these assays are summarized in Table 10. The 
results indicate that the mouse vaccinees exhibited detectable anti-RT IgGs with an 
adenovector dose as low as 10^7 vp. The humoral responses are highly dose- 
dependent and are boostable witii a second immunization. One or two doses of ei±er 

35 pol vectors elicit high frequencies of antigen-specific CD4"^ and CDS* T cells; the 
responses are weakly dose-dependent but are boostable with a second itmnunization. 
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Table 10. LmnunogeDicity of MRKAdSpol Vectors in BALB/c mice. 











Antl-RT IgG Titers' 


SFC/IO^eeell^ 


Group 


Vaccine 


Dose 


No. of 
Doses 


GUT 


+SE 


-SE 


Uedhiin 


CD44- 
peptide 

DODl 


CD84- 
peptide 

DOOl 


1 


MRKAdShCMVFMwl (B^) 


10^7 vp 


2 
1 


310419 
819 


301765 
372 


153020 
265 


1(1) 
1(1) 


75(4) 

7m 


2313(67) 
633(41) 


2 


MRKAdShCMVFLpol (E3+} 


10«9vp 


2 
1 


163840(f 
713155 


0 

528520 


0 

30^55 


2(2) 
1(1) 


114(9) 
48(7) 


2063(182) 
733(89) 


3 


MRKA£6hCMVFM]0l (E3-} 


10^7 wp 


2 
1 


310419 
6400 


3B&218 
, 14013 


172097 
4393 


0(0) 
10(8) 


223(7) 
141(21) 


2807(27) 
409(28) 


4 


MHKAcBtCMVapOl (E3-) 


10^ vp 


2 
1 


1638400^ 


0 

396725 


D 

300661 


1(1) 

m 


160(1^ 
39(13) 


2385(11) 
633(8^ 


5 


Nah/e 


none 


none 


57 


9 


7 


9(2) 


11(4) 


10(1) 



h^ear a al the upper QmR of the serial dOufion; hens, could 

'tia Of ^ot-ftHinfrig Cells per mHfion splecnoytes; mean values d triplicates aie reported along with standard errors in parentfie^ 

C57/BL6 mice were immunized once or twice with varying doses of 
MRK:Ad5hCMV-nef(G2A,LLAA) (E3+), MRKAd5mCMV-nef(G2A.LLAA) (E3+) 
at either lO'^? vp and(3) MRKAd5mCMV-tpanef(LLAA) ^3+) at either lO'^? vp and 
10^9 vp. The immune response were analyzed using similar protocols and the results 
are listed in Table 1 1: While anti-rief IgG responses could not be detected in this 
model system with any of the constructs, there are strong indications of a cellular 
immunity generated against nef using the ELIspot assay. 



Table 1 1 . Immunogenicity of MRKAdSnef Vectors in C57/BL6 mice 











Antt-nef IgG Titers* 


SFC/10^ celte" 








No. of 


GMT 


+SE 


-SE 




aa51-70 


aa81-100 


Group 


Vaccine 


Dose 


Doses 


Medium 


CD8+ 


CD4+ 


1 


MRKAiShCMVFIjnef (E34-) 


10^7 vp 


2 


174 


70 


50 


1(1) 


23(1) 


1(1) 








1 


132 


42 


32 


0(0) 


m 


0(0) 


2 


MFIKAdShCMVFLnei (E3i-) 


10^ vp 


2 


174 


70 


50 


0(0) 


61(7) 


4(2) 




1 


132 


42 


32 


1(1) 


62(7) 


3(1) 




MRKAdSmCMVFUef (JE3*) 


10^7 vp 


2 


132 


42 


32 


3(1) 


15(5) 


5(2) 




1 


115 


46 


33 


3(2) 


3(2) 


4(2) 


4 


MFIKAdSmCMVFLnef (E3f) 


10^ VP 


2 


132 


42 


32 


4(2) 


83(13) 


5(1) 




1 


132 


42 


32 


2(1) 


29(2) 


4(0) 


5 


MRKAd5mCMVtpanef(Eaf) 


10^7 vp 


2 


132 


42 


32 


3(2) 


14(2) 


5(1) 








1 


100 


0 


0 


3(1) 


13(4) 


10(3) 


6 


MRKAd5mCMVtpanef(E3f) 


10^ vp 


2 


230 


170 


98 


3(2) 


14S(29) 


4(0) 




1 


115 


46 


33 


7(1) 


151(14) 


10(0) 


7 


NaiTve 


none 


none 


152 


78 


52 - 


21(2) 


18(8) 


26(3) 



'GMT, geometric mean titer of the cohort of 5 ndiGK SE, standard error of the gemsldc mean 

'Mo. of spot-forming cells per million splecnoytes; mean values of trfpllcates are reported along with standard errors h parenthesis. 



Monkey Studies - Cohorts of 3 rhesus macaques were immunized with 2 doses 
ofMRKAd5hCMV-IApol(E3+)andMRKAd5hCMV-IApol(E3-^^ The number of 
antigen-specific T cells (per million PBMCs) were enumerated using one of two 
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peptide pools (L and R) that cover the entire pol sequence; the results are listed in 
Table 12. Moderate-to-strong T cell responses were detected in the vaccinees using 
either constructs even at a low dose of 10^9 vp. Longitudinal analyses of the anti-RT 
antibody titers in the animals suggest that the pol transgene product is expressed 
efficiently to eHcit a humoral response (Table 13). It would appear that generally 
higher immime responses were observed in animals that received the E3- construct 
compared to the E3+ virus. 

Table 12. Pol-specific T Cell Responses in MRKAd5pol Immunized Rhesus 



Macaques. 



Vacolne frs0«4 wks) 


Monk# 


Prableed 


T=4 


Te7 


Tel 6 








Mock 


PolL 


PdR 


Mock 


PoiL 


PoIR 


Mock 


PolL 


PoIR 


Mock 


PolL 


PoIR 


IA;KAd5hCMV-IApci(E34) 


99C10D 


1 


0 


0 


1 


38 


31 


0 


52 


145 


0 


49 


715 


KMl vp 


99C215 


1 


2 


2 


10 


98 


249 


1 


109 


305 


22 


68 


250 


99D201 


5 


S 


4 


6 


149 


S5 


0 


40 


35 


0 


35 


IB 


^Rlc^d5hCMv^Apd(E34) 


99D212 


0 


2 


0 


4 


331 


114 


0 


58 


14 


0 


6 


6 


1CK9vp 


990180 


0 


4 


2 


0 


19 


192 


4 


36 


156 


5 


38 


106 


99C2D1 


8 


5 


21 


8 


62 


62 


0 


18 


32 


1 


14 


65 




99D239 


5 


2 


2 


20 


62 


172 


1 


68 


114 


9 


21 


40 


lONlvp 


99C166 


4 


12 


8 


5 


120 


421 


2 


271 


489 


16 


875 


530 


990084 


1 


8 


9 


8 


84 


484 


0 


14 


236 


1 


24 


264 


M%KA£hCMV-IApciCE3-) 


0C7C 


10 


10 


B 


12 


724 


745 


4 


322 


376 


4 


168 


176 


10^ vp 


CDIG 


2 


0 


1 


5 


474 


468 


0 


232 


212 


0 


101 


121 


con 


6 


6 


12 


10 


98 


110 


5 


€0 


80 


6 


25 


34 


Ndve 


083Q 


nd 


nd 


nd 


nd 


nd 


nd 


4 


2 


2 


2 


1 


2 



nd, not ftuiemilned 

Reported are SF€ per n^ion PBMCs; mean at di^Alefllo wbAl 



Table 13. Anti-RT Ig Levels in MRKAdSpol Immunized macaques. 



RT ANTIBODY ASSAY TITERS INfflMUAnL 








Voodne^Monkey Top 


T=4 


Ta? 


Ta:12 


Tsl6 


MRKAd5hCMV-4ADOl(E3+). lO^'ll VD 










99C100 


61 


1999 


5928 


4768 


99C215 


81 


1541 


2366 


2767 


99D201 


63 


336 


539 


387 












MRKAd5hCR/lV-lADOl(E3+). 10'^9vD 










99D212 


10 


40 


49 


68 


99D180 


<10 


36 


79 


93 


99C201 


<10 


37 


71 


76 












MRKAcfihCMV-IAoolffiS-). lO^'ll VD 










99D239 


44 


460 


1234 


1015 


99C186 


21 


• 233 


480 


346 


99C084 


235 


2637 


2868 


1626 












MRKAd5hCMV-IADol(E3-). 10*9vD 










0C7C 


32 


175 


306 


235 


CDIG 


20 


140 


273 


419 


CDll 


16 


112 


149 


237 
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When rhesus macaques weie immunized i.m. with two doses of MRKAdSnef 
5 constructs, vigorous T cell responses ranging &om 100 to as high as 1 100 per million 
were observed in 8 of 12 vaccinees (Table 14). The efficacies of the mCMV- and 
hCMV- driven nef constructs are comparable on the basis of the data generated thus 
far. 

10 Table 14. Nef-spedfic T cell Responses in MRKAdSnef Immunized Rhesus 

Macaques. • " • -•' -• 



Vaccine (T=0,4 wks) 


Monk# 


Pre 


T=4 




=7 


T= 


16 






Mock 


Nef 


Mock 


Nef 


Mock 


Nef 


Mock 


Nef 


MRKAd5hCMV-nei(Q2A,LLAA) (E3+) 


CD20 


0 


4 


31 


440 


4 


368 


1 


251 


10^11 vp 


CC7B 


0 


0 


2 


521 


0 


178 


1 


1522 




CC61 


2 


9 


31 


112 


0 


108 


11 


100 


MRKAd5hCMV-nef<Q2A,LLAA) (E3f} 


CC2K 


9 


9 


6 


52 


0 


35 


0 


15 




. CD15 


5 


4 


30 


998 


2 


586 


0 


434 


C016 


6 


1 


6 


1146 


0 


369 


1 


212 


MRKAd5iTiCMV-nef(G2A.LLM} (E3+) . 


99D191 


1 


5 


4 


614 


0 


298 


2 


419 


10*11 VP 


99D144 


4 


6 


5 


434 


0 


1100 


2 


932 


99C193 


1 


2 


1 


58 


1 


22 


0 


64 


MRKAd5iTiCMV-nef(G2A,LXAA) (E3f} 


99D224 


1.. 


JI1 


14. 


231 


1 


125 


0 


70 


lO^vp 


99D250 


B 


9 


4 


108 


0 


54 


0 


5 




99C120 


1 


6 


20 


299 


0 


92 


0 


79 


NaTve 


083Q 


nd 


nd 


IB 


22 


4 


5 


2 


1 



EXAMPLE 25 

15 Comparison of Clade B vs. Clade C T Cell Responses in HIV-infected Subjects 
PBMC samples collected from two dozens of patients infected with HIV-1 in 
US were tested in ELISPOT assays with peptide pools of 20-mer peptides overlapping 
by 10 amino acids. Four different peptide pools were tested for cross-clade 
recognition, and they were either derived from a clade B-based isolate (gag H-b; nef- 

20 b) or a clade C-based isolate (gag H-c, nef-c). Data in Table 15 shows that T cells 
from these patients presumably infected with clade B HIV-1 could recognize clade C 
gag and nef antigens in ELISPOT assay. Correlation analysis further demonstrated 
Aat these T cell responses against clade C gag peptide pool were about 60% of the 
clade B counterpart (Figure 24), while the T cell responses against clade C nef were . 

25 about 85% of the clade B counterpart (Figure 25). These results suggest that cellular 
immune responses generated in patients infected with clade B H[V-1 can recognize 
gag and nef antigens derived from clade C mV-L These data show that a HIV 
vaccine, such as a DNA or MRKAd5'-based adenoviral vaccine expressing a clade B 
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gag and/or nef antigen wOl potentially have the ability to provide a prophylactic 
and/or therapetic advantage on a global scale. 



Table 15 

Responses Shown as the Number of gIFN-Secreting T Cells per Million 
PBMCs . 



subject 


bleed date 1 aaa epitope # I mock 


gag H-b 


gagH-c 




nef-c 




( from mapping) 










«100 


19-JUI-9P 


12 


10 


3950 


1385 


1295 


1300 


»101 


25-JUI-99 


3 


15 


3885 


1280 


na 


1020 


#102 


2S-JUI-99 


4 


15 


1740 


850 


1255 


1785 


«104 


7-Jun-99 


2 


5 


1355 


1185 


na 


1060 


#107 


ll-Oct-99 


2 


25 


3305 


2795 


670 


870 


#405 


ll-Jul-99 


2 


15 


4575 


3180 


1700 


1500 


#501 


l9-Jul-99 


2 


15 


1100 


570 


3365 


3460 


#505 


I8-Jui-99 


5 


10 


2145 


1725 


1235 


na 


#506 


28-l^b-99 


2 


25 


150 


45 


400 


610 


#701 


28-Mar-99 


5 


30 


7620 


4775 


3320 


2780 


#709 


17-May-99 


3 


15 


2785 


1945 


1090 


1630 


#710 


24-May-99 


4 


5 


1055 


1080 


2210 


2140 



















EXAMPLE 26 

Characterization and Production of MRKAd5pol and MElKAdSnef 

Vectors in Roller Bottles 



Ejq>ansion of nef and pol Adenovectors - Nef and pol CsCl purified MRKAdS 
seeds were used to infect roller bottles to produce P4 virus to be used as a seed for 
further experiments. P4 MRKAdS pol and nef vectors were use4 to infect roller 
bottles at an MOI 280 vp/cell, except for hCMV-tpa-nef [E3+] which was infected at 
an MOI of 125 due to low titers of seed obtained at P4. 



Table 16 Viral particle concentrations for P5 nef and pol adenovectors 



Adenovecjtor 


AEX Titer 
(10'° vp/ml culture) 


AEX Titer 
(10%/ceU) 


Amplification 
Ratio 


hCMV-FL-nef [E3+] 


1.1 


0.9 


30 


mCMV-EL-nef [E3+] 


2.2 


2.1 


75 


hCMV-tpa-nef [E3+] 


0.07 


0.1 


5 


mCMV-tpa-nef IE3+] 


1.3 


0.9 


35 


hCMV-FL-pol [E3+] 


2.7 


2.1 


75 


hCMV-EL-pol [E3-] 


1.9 


1.3 


45 



91 



wo 02/22080 



PCT/USOl/28861 



10 



Roller Bottle Passaging - Passaging of the pol and r^f constructs continued 
through passage seven. Cell-associated (freeze/thaw lysis) and whole brodi (triton- 
lysis) titers obtained in all passages weic very consistent In general, MRKAdSpol is 
ca. 70% as productive as MRKAdSgag while MRKAdSnef is ca. 25% as productive 
as MRKAdSgag. Samples of P7 virus for both constructs were analyzed by V&CB by 
restriction digest analysis and did not show any rearrangements. 



Table 17. Passage Six Viral Productivity for MRKAd5pol and MRKAdSnef 





Xviabteai 
\riafaif 

Infection 


ycellsAnl). 
Harvest 


Cell Passage 
Number 


AEX Titer 
(Cell A^sodaied) 
lO^vp/mlcnltnre 


Titar 

10* vp/ccll 


AmpHfiralinn 

Ratio 


TritoQ Lysis Titar 
10" vpAml cnltnie 


hCKfV-FIrnBf[B3f] pool 


1.22,85% 




62 


0.8 


0.7 


25 


1.6 


1 

2 




0.99, 62% 
1.10.72% 












hCMV-FLi>o] [E34-] pool 


1.42, 89% 




62 




3.2 


115- 




1 

2 




1.22,70% 
1.42,74% 













IS Table 18. Passage Seven Viral Productivity 



for MRKAdSpol and MRKAdSnef 





XviablB(ll 
Vtabit 
jbifecticm 


fceOsfnS^, 
tty(%) 

Harvest 


Cell Passage 
Nnciber 


AEX Titer 
(Cell Associated) 
lO*" vp/ml caltnie 


T%er 
lO^vp^ceO 


Amplificadoa 
Ratio 


IHloB Lysis Titer 
10*° vpfnl calture 


hCMV-FL-ncf [E3+-] Ptool 


133.90% 




66 


1.0 


as 


29 




1 

2 




0^.70% 
1.18,73% . 












hCMV-FLpol IB3+I Pool 


0.90*. 90% 




56 


4.2 


4.7 


168 




1 
2 




1.18,88% 
1.04.80% 













20 



25 



MRKAdSnef and MRKAdSpol Viral Production Kinetics - A timecourse 
experiment was c:anied out in roller bottles to determine if the viral production 
kinetics of the MRKAdSpol and MRKAdSnef vectors were similar to those of 
MRKAdSgag. PER.C6® cells in roller botde cultures were infected at an MOI of 280 
vp/cells with PS MRKAdSpol, PS MRKAdSnef and P7 MRKAdSgag; for each 
adenovector, two infected bottles were sampled at 24, 36, 48, and 60 hours past 
infection. In addition, two bottles were left unsampled until 4S hpi when they were 
harvested under the Phase I process conditions. The anion-exchange HPLC viral 
particle concentrations of the fteeze-thaw recovered cell associated virus at the 24, 36, 
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48, and 60 hpi timepoints are shown in Figure 29A-B. The QPA titers show a similar 
trend (data not shown). 

Comparison ofhCMV- and mCMV-FL-nef - As (he titers obtained with the 
5 MRKAdSnef constract (hCMV-FL-nef) were lower than those obtained with 
MRKAdSjgag or MEXAdSpol, a viral productivity comparison expemnent was 
performed with mCMV-FL-nef . For each of the two adcnovcctors (hCMV- and 
mCMV-FL-nef)> two roller bottles were infected at an MOI of 280 vp/cell with 
passage five clarified lysate. The macroscopic and microscoptc observations of the 
10 four roller botdes were identical at the time of harvest. Analysis of the clarified lysate 
produced indicated a higher viral particle concentration in the botdes infected widi 
meMV-FL-ncf, as shown in Tabic 19. It is stipulated that the higher productivity with 
mCMV promote driven nef vector is due to lower nef ^pression levels in PER,C6® 
cells- experiments are underway at V&CB to measure nef expression levels. 

15 

Table 19. Passage Six Viral Productivity Comparison of hCMV- and mCMV-FL-nef 







Xv (10^ cells/ml). Viability (%) 


Cell Passage 


AEX Titer 


TUer 


Amplificaiioc 


Tdton Lysis Titer 






bfectioD 


Harvest 


Nusibec 




:0*vp/oell 


Ratio 


10" vp/mlciiltUEe 


hCMV-PL-nef 


Pool 


1.11,919b 




60 


IJ5 


1.4 


SO 


2.8 


(MRKAdSnef) 


1 
2 




1.23,75% 
134.74% 












mCMV-FL-nef 


Pool 


l.ll,9I9fc 




60 


23 


Zl 


75 


4.6 




1 




l.4».84% 














2 




1.18.77% 













20 EXAMPLE27 

Characterization and Large Scale Production of 
MRKAdSnef Virus in Bioieactors 
Materials and Methods - The experiment of the present example was run twice 
under the following conditions: 36.5°C, DO 30%, pH 7.30, ISQrpm agitation rate. 
25 no spargjng, life Technologies (CHbco, Invitrogeo) 293 SFM n (with 6n^ 

glutamine), 0,5M NaOH as base for pH control. During the first mn (B20010115), 
two lOL stirred vessel bioreactors were inoculated with PER,C6® cells at a 
concentration of 0.2x10^ ceDs/ml. Cells were grown until they reached a cell 
concentration of approximately 1x10^ cells/ml. The cells were infected with uncloned 
30 MRKAdSnef (G2A,LLAA) at k MOI of 280 virus particles (vp)/cell. For the second 
batch (B200102Q2), the same procedure as the first run was used, except the cells 
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were infected with cloned MRAdSnef . During both runs, the bioreactors were 
harvested 48 hours post-infection. Samples were taken and virus concentrations were 
determined from whole broth (v^dth triton lysis), supernatant, and cell pellets (3 X 
freeze/thaw) with the AEX and QPA assays. Metabolites were measured with 
5 BioProfile 250 throughout the process. 



Table 20: Experimental Conditions 



Temperature 




DO 


30% 


PH 


7.30 


Agitation 


150 rpm 


Sparginfi 


None 



Table 21 : Virus source used for experimsats. 

10 



Ron 


Batch ID 


ClooedAJncloncd 


MOI 






MR£Ad5nef 


(vp/cells) 


#1 


B20010115-1 


Uncloned 


280 




320010115-2 


Uncloned 


280 


#2 


B2001Q202-1 


Cloned 


280 




B20010202-2 


Qoned 


280 



Results - Table 22 and 23 show an the ability to scale up production of 
MRKAdSnef by growth in a bioreactor. 

15 Table 22: Virus Concentration as measured by the AEX assay 



Run 


Batch ID 


QonedAJncloned 
MRKAd5nef 


Vims Concentration @ 48hpi (b 


.10*^ vp/L) 


Supernatant 


Clarified Lysate 


Total 


Triton Lysate 


#1 


B20010115-1 


Uncloned 


0.72 


3.26 


3.98 


5.76 


B20010115-2 


Uncloned 


0.38 


1.67 


2.05 


2.46 


#2 


B2001Q202-1 


Cloned 


0.80 


6.00 


6.80 


8.88 


B20010202-2 


Cloned 


OJO 


6.00 


6.50 


8.47 



Table 23: Virus Titers as measured by the QPA assay 



Run 


Batch ID 


QonedAJncloned 
MRKAd5nef 


Virus Concentration @ 48hpi ClxlO*' lU/L) 


Whole 
Broth 


Supensatant 


Qarified 
Lysate 


Total 


Triton 
Lysate 


#1 


B20010115-1 


Uncloned 


0.13 


1.12 


1.76 


2.88 


11.28 


B20010115-2 


Uncloned 


0.14 


0.73 


1^4 


2.27 


5.86 


#2 


B20010202-1 


Cloned 


0.14 


0.97 1.62 


2.69 


11.89 


B2001Q202-2 


Cloned 


0.14 


1.17 1.70 


2.97 


12.47 



20 

The present invention is not to be Umited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in 
addition to those described herein will become apparent to those skilled in the art 
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from the foregoing description. Such modifications are intended to fall within the 
scope of the appended claims. 

EXAMPLE 28 

5 MRKAdSHTV-lgag Boosting of DNA-Primed Animals 

Groups of 3-5 rhesus macaques were immunized with (a) 5 mgs of VI Jns- 
Flgag (pVnnsCMV(no intron)-FL-gag-bGHpA), (b) 5 mgs of VI Jns-Flgag 
fonnulated with 45 mgs of a non-ionic block copolymer CRL1005, or (c) 5 mgs of 
10 VI Jns-Flgag fonnulated with 7.5 mgs of CRH005 and 0.6 mM benzalkonium 
chloride at weeks 0, 4, and 8. All animals received a single dose of 10e7 viral 
particles (vp) of tiie MRKAd5HIV-lgag at week 26. Note: lOe? is too low to prime 
or boost effectively when used as a single modality (dose is selected to radmic 

■ 

preexposure to adenovirus); see Figure 32. 

15 Blood samples were collected from all animals at several time points and 

peripheral blood mononuclear cells (PBMCs) were prepared using standard Ficoll 
method. The PBMCs were counted and analyzed for gamma-interferon secretion 
using the ELISpot assay (Table 24). For each monkey, the PBMCs were incubated 
ovemight either in the absence (medium) or presence of a pool (called "gag ET') of 50 

20 20-aa long peptides that encompass the entire HTV-l gag sequence. 

The results indicate tiiat MRKAdSHTV-lgag was very effective in boosting 
the T cell immune responses in these monkeys. At week 28 or 2 weeks after the viral 
boost, the number of gag-specific T cells per million PBMCs increased 2-48 fold 
compared to the levels observed at week 24 or 2 weeks prior to the boost 

25 The PBMCs were also analyzed by intracellular gamma-interferon staining 

prior to (at week 10) and after the MRKAd5gag boost (at week 30). The results for 
select animals are shown on Figure 31. The results indicate that (a) immunization 
with DNA/adjuvant formulation elicited T cell responses which can either be 
balanced, CD4*-biased or CDS'^-biased, and (b) boosting with the MRKAdSgag 

30 construct produced in all cases a strongly CD8"*"-biased response. These results 

suggest that boosting with MRKAdSHTV-lgag construct is able to improve the levels 
of antigen-specific CD8* T cells. 
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EXAMPLE29 

Construction of gagpol fusion for MRKAdSgagpol fusion constructs 

The open reading frames for the codon-optimized HIV-1 gag gene was fused 
5 directly to the open reading frame of the lA pol gene (consisting of RT, ENAseH and 
integrase domains) by stepwise PGR. Because the gene (SEQ K) NO: 38) does not 
include the protease gene and the frameshift sequence, it encodes a single polypeptide 
of the combined size of p55, RT, RNAse H and integrase (1350 amino adds; SEQ E) 
NO: 39). 

10 The fi:agment that extends from the BstEII site within the gag gene to the last 

non-stop codon was ligated via PGR to a fragment that extends from the start codon 
of the lApol to a unique BamHf site. This fragment was digested with BstEII and 
BamH/. Construction of gag-IApol fusion was achieved via three-fragment ligation 
involving the Pst/-BstEZ/ gag digestion fragment, the BstE22/BamH/ digested PGR 

15 product and long PstZ/BamH/ VlR-FI^ol backbone fragment. 

The MRKAdS-gagpol adenovirus vector was constructed using the BgLZZ 
fragment of the VlR-gagpol containing the entire ORF of gag-IApol fusion gene. 

EXAMPLE 30 

20 Immunogenicity Studies in Non-Human Primates 

Cohorts of three (3) macaques were immunized with lOeS or lOelO viral 
particles (vp) of one of the following MRKAd5 HIV-1 vaccines: (1) MRKAdSgag; 
(2) MRKAidSpol; (3) MRKAdSnef ; (4) a mixture containing equal amounts of 
25 MRKAdSgag, MRKAdSpol, and MRKAdSnef, or (5) a mixture of equal amounts of 
MRKAdSgagpol and MRKAdSnef. The vaccines were administered at weeks 0 and 

4. 

Hie T cell responses against each of the HIV-1 antigens were assayed by lEN- 
gamma ELISpot assay using pools of 20-aa peptides that encompass the entire protein 

30 sequence of each antigen. Th& results (Table 25) are expressed as the number of spot- 
forming cells (sfc) per million peripheral blood mononuclear cells (PBMC) that 
respond to each of the peptide pools. 

Results indicate die following observations: (1) each of the single gene 
constructs (MRKAdSgag, MRKAdSpol, or MRKAdSnef) is able to elicit high levels 

35 of antigen-specific T cells in monkeys; (2) the single-gene MRKAdS constructs can 
be mixed as a multi-cocktail formulation capable of eliciting very broad T cell 
responses against gag, pol, and nef ; (3) the MRKAdS vector expressing the fusion 
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protein of gag plus lA pol is capable of inducing strong T cell responses to both gag 
and pol. 

Table 25. Evaluation of Mixtures of MRKAdS vectors e^qpressing humanized 



HIV-l gag» pol, gagpolynef in rhesus macaques 



Grp# 


Vaccine 


Monk# 


T=6 wks 




T=0, 4 v/ks 








I Pot - 1 


Pol - 2 




1 

■ 


MRKAdS aaa 


CB9V 


0 


15 










lO^IOvp 


GDI 9 


U 






m 


* 






109H 


1 










2 


MRKAdS aaa 


99D130 


1 


948 


- 


m 






i(y«vp 


W277 


To 






m 








143H 


A 










3 


MRKAdS pol 


CC1X 


4 




46 


256 


- 




lO^IOvp 


AW3W 


Q 

o 














AV43 


o 






lOOO 




4 


MRKAdS pol 


AW36 


1 


■ - 


19 


30 


- 






CC8K 


n 
u 




ou 










CC21 


1 
1 






■rww 




5 


MRKAdS nef 


076Q 


9 


- 


- 


- 


1204 




lO^IOvp 


091 Q 


4 








85 






083Q 


0 








176 


6 


MRKAdS nef 


00C029 


1 








114 




i(y>evp 


98D022 


6 








170 






98D160 


3 


m 




* 


198 


7 


MRKAdSgag+MRKAdSpot+MRKAdSnef 


99D251 


3 


206 


15 


193 


120 




10^10 vp each 


05H 


3 


135 


21 


9 


638 






00C016 


3 


26 


4 


51 


23 


8 


MRKAdSgag+MRKAdSpd+MRKAdSnef 


99D215 


1 


171 


18 


193 


240 




IQABvpeach 


81 H 


5 


73 


6 


14 


243 




12H 


8 


1140 


115 


811 


719 


9 


MRKAdSgagpot -i^RKAdS nef 


99D211 


0 


83 


56 


838 


725 




10^10 vp each 


22H 


4 


385 


119 


1194 


1915 






61H 


4 


343 


11 


765 


853 


10 


MRKAdSgagpol +MRKAd5 nef 


34H 


3 


78 


19 


5 


75 




10^ vp each 


48H 


1 


65 


105 


46 


43 




70H 


5 


158 


15 


220 


191 



Indicated are numbers of spot-forming cells per million PBMCS against the peptide pools. Mock, no peptides; gag . 

fifty 20-aa peptides encompassing p55 sequence; pol-1, 20-aa peptides representing N-tenxmial half of lA pol; 
pol'2, 20-aa peptides representing the carboxy-terminal half of lA pol; nef» 20-aa peptides encompassing the entire 
wild-type nef sequence. Responses to the antigens prior to the first immunization did not exceed 40 sfc/10^ 



PBMC 
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WHAT IS CLAIMED IS 

* 

1. A recombinant adenoviral vaccine vector at least partially deleted in 
5 EI and devoid of El activity, comprising: 

a) an adenovirus ci.9-acting packaging region corresponding to from 
about base pair 1 to between from about base pair 400 to about 
base pair 458 of a wildtype adenovirus genome; and 

b) a gene encoding an HIV protein or immunologically relevant 
10 modification thereof . 

2. A vector in accordance with claim 1 comprising a packaging region 
coiresponding to from about base pair 1 to about base pair 450 of a wildtype 
adenovirus genome. 

3. A vector in accordance with claim 1 further comprising nucleotides 
15 corresponding to between from about base pair 351 1 to about 3524 to about base pair 

5798 of a wildtype adenovirus genome. 

4. A vector in accordance with claim 3 comprising base pairs 
coiresponding to 1-450 and 3511-5798 of a wildtype adenovirus genome. 

5. A vector in accordance with claim 4 which is deleted of base pairs 

20 451-3510. 

6. A vector in accordance with claim 1 which is at least partially 

deleted in E3. 

7. A vector in accordance with claim 6 wherein the E3 deleted region 
is from base pairs 28,133-30,818. 



99 



wo 02/22080 



PCT/USOl/28861 



8. A vector in accordance with claim 1 wherein the gene encoding the 
HIV protein or modification thereof cornprises codons optimized for expression in a 
human. 

9. A vector in accordance with claim 1 wherein the vector con^aises a 
gene expression cassette comprising: 

a) a nucleic acid encoding a protein; 

b) a heterologous promoter operatively linked to the nucleic acid 
encoding the protein; and 

(c) a transcription termination sequence. 

10. A vector in accordance with claim 9 wherein the gene expression 
cassette is inserted into the El region. 

11. An adenoviral vector in accordance with claim 9 wherein the gene 
expression cassette is in an El parallel orientation 

12. An adenoviral vector in accordance with claim 9 wherein the gene 
expression cassette is in an El antiparallel orientation. 

13. An adenoviral vector in accordance with claim 9 wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences. 

14. An adenoviral vector in accordance with claim 13 wherein the 

promoter is an immediate early human cytomegalovirus promoter. 

,■• 

15. An adenoviral vector in accordance with claim 9 wherein the 
promoter is a murine cytomegalovirus promoter. 

16. An adenoviral vector in accordance with claim 9 wherein the 
transcription termination sequence is a bovine growth hormone polyadenylation and 
transcription termination sequence. 
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17. An adenoviral vector in accordance with claim 9 wherein the 
transcription termination sequence is a synthetic polyadenylation signal (SPA). . 

18. A cell comprising the adenoviral vector of claim 1. 

19. Recombinant, replication-defective adenovirus particles harvested 
5 and purified subsequent to transfection of the adenoviral vector of claim 1 into a cell 

line which expresses adenovirus El protein at complementing levels. 

20. An HIV vaccine composition comprising purified adenovirus 

particles of claim 19. 

21. An HIV vaccine composition of claim 20 which comprises a 

10 physiologically acceptable canier. 

22. A method of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 
claim 1 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, and harvesting the resultant recombinant, 

15 replication-defective adenovirus. 

23. A method according to claim 22 wherein the cell is a PER.C6® 

cell. 

24. A method of generating a cellular-mediated immune response 
against HEY in an individual comprising administering to the individual a vaccine of 

20 claim 21. 

25. A method according to claim 24 which further comprises 
administration to the mdividual a DNA plasmid vaccine, optionally administered with 
a biologically effective adjuvant, protein or other agent capable of increasing the 
immune response. 
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26. A method according to claim 25 wherein the DNA plasmid 
vaccine is administered to the individual prior to administration of an adenovirus 
vaccine. 

27. A method according to claim 24 wherein the adenovirus vaccine is 
5 preceded by an adenovirus vaccine of a different serotype. 

28; A method according to claim 24 which comprises administering 
and readministering the adenovirus vaccine vector to the individual. 

29, An adenoviral vector in accordance with claim 1 wherein the HIV 
protein is HIV gag or an immunologically relevant modification thereof. 
10 30. An adenoviral vector in accordance with claim 9 wherein the gene 

expression cassette comprises an open reading frame encoding an HTV gag protein or 
immunologicaUy relevant modification theieof . 

31. A recombinant adenoviral vaccine vector at least partially deleted 
in El and devoid of El activity, comprising: 
15 a) an adenovirus cw-acting packaging region corresponding to from 

about base pair 1 to about base pair 450 and from about 351 1 to about 

5798 of a wildtype adenovims genome, and deleted for base pairs 

corresponding to from about base pair 451 to from about base pair 

3510 of a wildtype adenovirus genome; and 
20 b) a gene expression cassette comprising 

i) SEQIDNO:29; 

ii) a heterologous promoter operatively linked to i); and 

iii) a transcription teiminadon sequence. 
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32. An adenoviral vector in accordance with claim 31 wherein the 
gene expression cassette is in an El parallel orientation, 

33 An adenoviral vector in accordance with claim 31 wherein the g^e 
expression cassette is in an El antiparallel orientation. 
5 34. An adenoviral vector in accordance with claim 31 wherein the 

promoter is a cytomegalovirus promoter devoid of intronic sequences. 

35. An adenoviral vector in accordance with claim 31 wherein the 
transcription termination sequence is a bovine growth hormone polyadenylation and 
transcription termination sequence. 
10 36. An adenoviral vector in accordance mih claim 31 which is at least 

partially deleted in E3. 

37. A cell comprising the adenoviral vector of claim 30. 

38. Recombinant, replication-defective adenovirus particles harvested 
and purified subsequent to transfection of the adenoviral vector of claim 30 into a cell 

1 5 line which expresses adenovirus El protein at complementing levels. 

39. An mv vaccine composition comprising purified adenovirus 

particles of claim 38. 

40. An HIV vaccine composition of claim 39 which comprises a 

physiologically acceptable carrier. 
20 4 1 . A method of producing recombinant, replication defective 

adenovirus particles containing the adenoviral genome of the adenoviral vector of 

* 4 

claim 30 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, and harvesting the resultant recombinant, 
replication-defective adenovirus. 
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42. A method according to claim 41 wherein the cell is a PER,C6® 

cell. 

43. A method of generating a ceUular-mediated immune response 
against HIV in an individual comprising administering to the individual a vaccine of 

5 claim 21. 

44. A method according to claim 43 which further comprises 
administration to the individual a DNA plasmid vaccine, optionally administered with 
a biologically effective adjuvant, protein or other agent capable of increasing the 
immune response. 

10 4S. A method according to claim 44 wherein the DNA plasmid 

vaccine is administered to the individual prior to administration of an adenovirus 
vaccine. 

46. A method according to claim 43 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 
15 47. A method according to claim 43 which comprises administering 

and readroinistering the adenovirus vaccine vector to the individual. 

48. An adenoviral vector in accordance with claim 1 wherein the HIV 
protein is HTV pol or an immunologically relevant modification thereof. 

49. An adenoviral vector in accordance with claim 9 wherein the gene 

k ' • " 

20 expression cassette comprises an open reading frame encoding an HTV pol protein or 
immunologically relevant modification thereof. 

50. A recombinant adenoviral vaccine vector at least partially deleted 

in El and devoid of El activity, comprising: 
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a) an adenovirus cw-acting packaging region corresponding to from 

■ 

about base pair 1 to about base pair 450 and from about 3511 to about 
5798 of a wildtype adenovirus genome, and deleted for base pairs 
corresponding to from about base pair 451 to from about base pair 
3510 of a wildtype adenovims genome; and 

b) a gene expression cassette comprising 

i) a nucleotide sequence selected the group consisting of 
SEQ ID NO: 1, SEQ ID NO: 5 and SEQ ID NO: 7; 

ii) a heterologous promoter operatively linked to i); and 

iii) a transcnption termination sequence. 

51. An adenoviral vector in accordance with claim 50 wherein the 
gene expression cassette is in an El parallel orientation. 

52. An adenoviral vector in accordance with claun 50 wherein the 
gene expression cassette is in an El antiparallel orientation. 

53. An adenoviral vector in accordance with claim 50 wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences. 

54. An adenoviral vector in accordance with claim 50 wherein the 
transcription termination sequence is a bovine growth hormone polyadenylation and 
transoiption termination sequence. 

55. An adenoviral vector in accordance with claim 50 which is at least 
partially deleted in E3. 

56. A cell comprising the adenoviral vector of claim 49. 
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57. Recombinant, replication-defective adenovirus particles harvested 
and purified subsequent to transfection of the adenoviral vector of claim 49 into a cell 
line which expresses adenovirus El protein at complementing levels. 

58. An mv vaccine composition comprising purified adenovirus 
5 . particles of claim 57. 

59. An HTV vaccine composition of claim 58 which comprises a 
physiologically acceptable carrier. 

60. A method of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 

10 claim 49 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protem, and harvesting the resultant recombinant, 
rephcation-defective adenovirus. 

61. A method according to claim 60 wherein the cell is a PER.C6 

ceil. 

15 62. A method of generating a cellular-mediated immune response 

against HIV in an individual comprising administering to the individual a vaccine of 
claim 59. 

63. A method according to claim 62 which further comprises 
administration to the individual a DNA plasmid vaccine, optionally adnoinistered with 

20 a biologically ejBFective adjuvant, protein or other agent capable of increasing the 
immune response, 

64. A method according to claim 63 wherein the DNA plasmid 
vaccine is administered to the individual prior to administration of an adenovirus 
vaccine. 
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65. A method according to claim 62 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 

66. A method accoiding to claim 62 wMch comprises administering 
and readministering the adenovirus vaccine vector to the individual. 

67. An adenoviral vector in accordance with claim 1 wherein the HIV 
protein is HIV nef or an immunologically relevant modification thereof. 

68. An adenoviral vector in accordance with claim 9 wherein the gene 
expression cassette comprises an open reading firame encoding an HTV nef protein or 
immunologically relevant modification thereof. 

69. A recombinant adenoviral vaccine vector at least partially deleted 
in El and devoid of EI activity, comprising: 

a) an adenovirus cw-acting packaging region corresponding to from 
about base pair 1 to about base pair 450 and from about 3511 to about 
5798 of a wildtype adenovirus genome, and deleted for base pairs 
corresponding to from about base pair 45 1 to firom about base pair 
3510 of a wildtype adenovirus genome; and 

b) a gene expression cassette comprising 

i) a nucleotide sequence selected the group consisting of 
SEQIDNO: 9,SEQIDNO: I1,SEQIDN0: 13 and 
SEQ ID NO: 15; 

ii) a heterologous promoter op^tively linked to i); and 

iii) a transcription termination sequence. 

70. An adenoviral vector in accordance with claim 69 wherein die 
gene expression cassette is in an El parallel orientation. 
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7 L An adenoviral vector in accordance with claim 69 wherein the 
gene expression cassette is in an El antiparallel orientation. 

72. An adenoviral vector in accordance with claim 69 wherein the 
promoter is a cytomegalovirus promoter devoid of intronic sequences. 
5 73. An adenoviral vector in accordance with claim 69 wherein the 

transcription termination sequence is a bovine growth hoimone polyadenylation and 
transcription termination sequence. 

74. An adenoviral vector in accordance with claim 69 which is at least 
partially deleted in E3. 
10 75. A cell comprising the adenoviral vector of claim 68. 

76. Recombinant, replication-defective adenovirus particles harvested 
and purified subsequent to transfection of the adenoviral vector of claim 68 into a cell 
line which expresses adenovims El protein at complementing levels. 

77. An HIV vaccine composition comprising purified adenovirus 
15 particles of claim 76. 

78. An HIV vaccine composition of claim 77 which comprises a 
physiologically acceptable carrier. 

79. A method of producing recombinant, replication defective 
adenovirus particles containing the adenoviral genome of the adenoviral vector of 

20 claim 68 which comprises introducing the adenoviral vector into a host cell which 
expresses adenoviral El protein, arid harvesting tiie resultant recombinant, 
replication-defective adenovirus. 

80. A metiiod according to claim 79 wherein the ceU is a PER.C6® 

cell. 
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81. A method of generating a cellular-mediated immune response 
against HIV in an individual comprising administering to the individual a vaccine of 
claim 78. 

82. A method according to claim 81 which further comprises 

5 administration to the individual a DNA plasmid vaccine, optionally administered with 
a biologically effective adjuvant, protein or other agent capable of increasing the 
immune response. 

83. A method according to claim 82 wherein the DNA plasmid 
vaccine is administered to the individual prior to administration of an adenovirus 

10 vaccine. 

84. A method according to claim 81 wherein the adenovirus vaccine is 
preceded by an adenovirus vaccine of a different serotype. 

85. A method according to claim 81 which comprises administering 
and readministering the adenovirus vaccine vector to the individual. 

15 86. A multivalent adenovirus vaccine composition comprising 

recombinant, replication-defective adenovirus particles, wherein the adenovirus 
particles are harvested and purified from a cell line expressing adenovirus El f)rotein, 
and wherein the particles are harvested subsequent to transfection of the cells with an 
adenoviral vector or vectors in accordance with claim 9; said vector(s) comprising a 

20 gene expression cassette or cassettes comprising nucleotide sequences encoding HIV 
proteins selected from the group consisting of: 

* 

a) gag, pol, and nef , expressed independently from three individual 
vectors; 
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b) gag, pol, and nef , expressed independently from one vector with 
the encoding nucleic acid sequences bperatively linked to distinct 
promoters and transcription termination sequences; . 

■ 

c) gag, pol, and nef , expressed via two vectors, one expressing a pol- 
S nef fusion, and another expressing gag; 

s 

d) gag, pol, and nef, expressed via two vectors, one expressing a gag- 
pol fusion and another expressing nef; 

e) gag, pol and nef, expressed via two vectors, one expressing a nef- 
gag fusion and anoth^ expressing pol; 

10 f) gag, pol, and nef, expressed via one vector expressing a gag-pol- 

nef fusion; 

g) gag and pol, expressed independently fh)m two individual vectors; 

h) gag and pol, expressed independently fiom one vector with the 
encoding nucleic acid sequences operatively linked to distinct 

15 promoters and transcription termination sequences; 

i) pol and nef, expressed independently from two individual vectors; 
j) pol and nef, expressed independently from one vector with the 

encoding nucleic acid sequences operatively linked to distinct 
promoters and transcription termination sequences; 
20 k) nef and gag, expressed independently from two individual vectors; 

1) nef and gag, expressed independently from one vector with the 
encoding nucleic acid sequences operatively linked to distinct 
promoters and transcription termination sequences; 

m) gag and pol, expressed via one vector expressing a gag-pol fusion; 
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n) 



pol and nef , expressed via one vector expressing a pol-nef fusion; 



and 



o) 



nef and gag, expressed via one vector expressing a nef-gag fusion. 
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A multivalent adenovirus vaccine coniiposition in accordance with 



5 claim 86 wherein the gag-pol fusion consists of SEQ ID NO: 39. 

88. A multivalent adenovirus vaccine composition in accordance with 
claim 86 wherein the fused sequences have the encoding nucleic acid sequences 
operatively linked to distinct promoters and transcription termination sequences. 

89. A multivalent adenovirus vaccine composition in accordance with 
10 claim 86 wherein the fused sequences have the encoding nucleic add sequences 

operatively linked to a single promoter, and the encoding nucleic add sequences 
operatively linked by an internal ribosome entry sequence.("IRES"). 
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OriQ'mat Adenovector Construct: 




El- 
Figure 1: Original HIV-l gag adenDvector. 
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Seauenee of the open readino frame for FL*qbq fhuman codon optimized] 



atgggtgctagggcttrtgtgctgtrtggtggigagciggacaagtgggagaagatcaggctgaggcctggl^ 

caagaagaagtdcaagctaaagcacattgtgigggoctccagggagctggagaggtttgctgtgaacDCtggc 

dgclggagaa:trigagggg1gcaggcagatcctgggxagrtCMgocxJlD«:t9«^ 

agdgaggtcc^tgtacaacacaglggcrtacoclgtactgtgtgcaccagaagattgatgtgaaggacao^ 

gaggccctggagaagangaggaggagcagaacaagtccaagaagaaggcccagcaggctgctgclggc 



gcc3tcrta««®ga(»;tgaat9ccrtgggtgaagglgglggaggagaaggc«ttcl^ 

catgttrtrtgccdgtctgaggglgccaaxxsccaggacclgaacaccatgd 

aggclgc^tgc3gatgclgaaggagaccatcaatgaggaggctgctgaglgggacaggclgcatoctgtgc 

ac:gctggax:cattgccx«^ggccaga1gagggagcccaggggctctgacattgc1ggra^ 

ccaggagcagattggc1g9Btgaccaacaaco(XJCccatc«Jctgtgggggaaatctara^ 

c«1gggcclgaacaagatlV^gaggalgtactc«x?caw:tccatcctggacatcaggcagggc^^ 

cccttcagggactatgtggacaggttctacaagaccclgagggctgagcaggcctcccaggaggtgaagaact 

ggalgacagagaccsctgctggtgragaalgcxaacc^lgactgcaagaccatcctgaaggcc^ 

ctgc^occtggaggagatgatgaQagcctgcoagggggtggggggccctggtcaoaaggccagggtgclg 



gaagacagigaagigcttcaactglggcaagglgggccacattgccaagaactgtagggcccccaggaaga 

agggcigclggaaglgtggcaaggagggcicaccagatgaaggactgcaalgagaggcaggccaacttcctg 

ggcaaaalclggc^lcccacaagggcaggoctggcaacncctw^agtcraggcctgagccxiacagcccct 

cccgaggagtccttcaggtttggggaggagaagaccaoccccagccagaagcaggagcccattgacaagg 

agclglacKxctggDCtcxjdgagglccctgWggcaacgacc«Jlcctca;agtaaaalaaag^^ 

gat (SEQIDNO:29) 
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Old Transgene: 



hCMV 



InirDn A 



OAG 



&0H 
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Figure 3: Diagrammatic representation of the original gag transgene aind the series of 
new transgene constructions. 
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Figure 4: Modif icalions made to the current adenovector backbone in the generation of the new 
vector. 
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Figure 5: Virus mixing experiments to delermine the ef ects of the addrtion made to the packaging signal 
region (Expl #1 ) end anai^'sis of the effects o1 the E3 gene on viral growth (Expl. *2). The red bars denote 
the region of modHi^tions made to the El deletion. 
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Figure 6: AutoradiDgraph o1 viral DNA anafysis following viral mixing experiments (expts. #1 
and <f2} as detaiied in the text. 
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Raure 7A: hCMV-FLoao-bGHpA adenov&ctors constructed wrthin the -MR»C backbone. El 
paralte! and E1 antlparallel transgene orientation wtthin the E3- and E3^ backbones were 
construdBd. 
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PiQure 7B- hCMV-FLoao-SPA adenovedors constructed vmh the "MRK" backbone. El 
^relief and E1 anttparallei uansgene orientalion wHhm the E3- and backbones ware 
constructed. 



1 



8/144 



wo 02/22080 



PCT/USOl/28861 



ITR, o, P* CAS »nCVV mu 




Figure 7C: mCMV-FLgag-bGHpA adenoveclors eonsti^ucled within the*MRIC backbone. £1 
parallel and E1 enliparaliel transgene orientation within the E3- and £3^- bacicbones were 
ccKistructed 
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ngure 8A: Effect of transgene orientation 
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Figure BB: Effect of polyadenylation signal 
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Figure 9: Viral DNA from the four Adgag candidates at P5. folbwfiTg BsfEII digestion. 
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Figure 10: Viral DMA analysis of passage 11 and/or 12 of MRKHVE3. MRKAd^ag and 
MRKAdSgagCES-}. 
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Figure 11: Viral DMA analysis (H/ndlll digestion) of passage 6 MRKHVE3 and MRKAdSgag used 
to initiate the virai competition study. Last two lanes are passage 1 1 analysis of duplicate 
passages ol the competition study (each virus at MO) 26D vp). 
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Figure 12: Viral DNA analysis by H/ndlll digestion on high passage numbers for MRKAdSgag in 
seruTTi containing media with cofleclions made at specKled times. The frst iane shows the 1 Kb 
DNA size marker. The other lanes represent pre-plasmld control (digested with Pad and H/ndlll). 
and MRKAdSgag vinjs continually passaged to PI 6, P1B and P21 (serum containing media). 
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Figure . Serum ami-p24 Levd$ ai 3 Wks post i.m. immunizaiion of balb'c mice (n=10) 
wiih Vamng Doses of Several Adgag constructs: (A) MRKAdSgag (ihrougb passage 5): 
(B) NdJyi^dS E3' bCMV.FLgag-bGHpA: (C) MRJCAd5 bCMV^FLgag-SPA; (D) 
MRKAd5 E3" mCMV-FLgag-bGHpA; (D) research Loi (293 cell-derived) of AdSHIV- 
]gas; and (F) clinical lot (Ad5gagFN0001) of AdSHIV-lgag. Reponed are the geometric 
mean liiers (GMT) for eacb cohort. 
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FIGURE 16 
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AGAlCTACCATGG(mATCT0CCCCATTGAGACTGTGCCTGTGAAGCTGWDCCTmTGGA7K;CC(^^ 
STTT lielAloProlleSerProlleGluThrVolProVolLysLeulysPfoGlyMelAspGlyProlysVolLy 
I 10 20 ■ • 

GCAGTGGCCCCTGACTGAGCAGAACATCAAGGCttlGCTGGAAATCTGCAClGAGATGGAGAAGGADCGCAAAA^ 
sG InTr pProLeuThrG I uG I u^ys 1 lelysA I oLeuVo IC I ul I eCysThrC I uMbIG I uLysC I iC I yLys 1 leSer L 

30 40 50 

AGATTGGCOCGAGAACCCCTACAACACCCCTGTGTTTGaATCAADAAGAAGGACTCCAaAAGTG^^ 
ysIleClyProGluAsnProTyrAsnThrProVolPheAlolleLysLysLysAspSerThrLysTrpArgLysleuVol 

60 70 

GACTTCmAGCTGAA(yy«AGGACCCAGGACT7ClGGGAGGTGCAGCTGGGCATCCCCCAC^ 
AspPheArgG I uLeuAsnLysAr gThrG InAspPheTrpC luVo IG I nLeuG 1 yl leProHi sProAloD I yLeuLysLy 

BO 90 ^00 

CAAGMGTClGTGAClGTGClGGCTCTGGGGGATGaTACTlCTCTGTGCCCCTGGATGAGCACTTCAGGAAGTACAClC 
sLvsLvsSerVolThrVolLeuAloVolGlyAspAloTyrPheSerVolProLeuAspGluAspPheArgLysTyrThrA 

110 120 130 

CCTTCACCAircCCTCCATCAACAATGAGACCCCTGGCATCAGGTACCAGTACAATGTGCTGCCCCADGGCTGGAAGGGC 
I oPheTnr J I ePr oSer 1 1 eAsnAsnG I uThrProC I y 1 1 cArgTyrG I nTyr AsnVo I LeuPr oG I nG I yTrpl ysG I y 

140 150 

TCCCCTGCCATCTTCCAGTCCTCCATGACCAAGATCCTGGAGCCCTTCAGGAAGCAGAACCCIGACATTGTGATCTACCA 
SerProAloIlePheGlnSerSerMelThrLysl leLeuGluProPheArgLysGlnAsnProAspI leVol I lelyrGI 
160 170 '80 

CTACATCGCTGCCClGTATGTGGGClCTGACCTGGADAnGGGCAGCACAGGACCAWSATTGAGGAGCTCAGGCACCACC 
nTyrMe lAloMoleuTy r Vol G t ySer A5pLei£ I ul leC lyG I nHis^^^^ 

TGCIGADGltrnJCCIGACCACCaiGACAAGAAKADCAGAAfiGAGCC^ 
eiieuArgTrpGlyLeuThrthrProAspLysLysHlsGlnLysGluProProPheLeulrpMelGlyTyrCluleuHis 

220 230 

CCCGACm7G(yClGlGCAXCCAT7GTGCTGCClGACAAGGACTCCT0GACTGTGAA16ACATC^ 
ProAspLysTrpThrVolClnProlleVolLeiiProGluLysAspSerTrpTbrVolAsnAspIleGlnLysLeuYolGl 

240 250 260 

CAADCTGAACTl^CCT(XCAAAlCTACCCTGGWlCAAGCTGAGGCA(XTCTGCAADC7a^^ 
yLysLeuAsnlrpAloSBrGlnlleTyrProGlylleLysVolArgGlnLeuCysLysLeuLeuArgGlyThrlysAloL 

270 280 290 



FIGURE HA 
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7GACTGAGGTGA1CCCCCTGACTGAGGAGGCTGAGCTGGAGC7GGC1GAGWCAGGCAGATCCTGAAGGAGCCTGTGCAT 
[ uThrG I uVo 1 1 1 ePr oLe ulhrG I uG I uA I oG I ul euG I uL e uA I oC I uAsnAr gG I u 1 1 eL euL ysG I uPr oVo I H i s 

300 310 

K^TGIACTATGACCCCTCCAADGACCTCATICCTGACATCXAGAAGCADGGCC^ 
GlyVolTyrTyrAspPrpSerLysAspLeul leAloGluI leGinlysGlnGlyClnGlyGlnTrpThrTyrGlnl leTy 

320 330 .340 

CCAGGADCIXTTCAAGAAlXTGAAGACTGGCAAGTATGttAGGATGAOCGGOGCCW 
rGlnDloProPheLysAsnLcuLysThrBlyLysTyrAloArgMelArgClyAloHisThrAsnAspVollysGInLeuT 

350 360 370 

Ci:-AGGCTGTGCAGAAGATCACCACTCAGTCCATTCTCAlCT(W»CAAGAmCAAGTTCAArcTBC^^ 
hrGluAloVolGlnLysIleThrThrCluSerlleVollleTrpGlyLysThrPrelysPhcLysLeuProlleGlnLys 

3S0 390 

GAGACCTCGGAGACCTGGTKACTGAGTACTGGCADGOCACCTGGAlCaTCAGTGGGAGTllGTGAACACCCCCG^ 
GluThrTrpGluThrfrpTrpThrGluTyrTrpGlnAloThrTrplleProCluTrpGluPheVolAsnThrProProLe 
400 410 420 

GG1GAAGCTGTGG1ACCAGCTGGAGAAGCAGCCCA71GTGGGGGCTGAGACCTTCTATGTGGC7GGGGCTGCCAACAGGG 
uVolLysLeulrpTyrGlnLeuGluLysGluProIleVolGlyAIoGiuThrPheTyrVolAloGlyAloAloAsnArgG 

430 440 .450 

ffiACCAAGClGGGCAAGGCTCGClATGTGACCAACAGGCGCAGGCAGAAGGiGGTGACCCTGACTGACACCACCAAC^ 
I uThr L ysLeuG I vLysA I oG I yTy rVo I Thr AsnAr gG i y Ar gG I nLysVo I Vo I ThrLeuThr AspThrThr AsnG I n 

460 470 

AAGACTGCCnCCAGGCCATCTACCTGGCCCTCCAGGACTCTGGCCTGGAGGTGAACA77GTGACTGCCTCCCAGTATGC 
LysThrAloLeuGlnAloIlcTyrLeuAloLeiClnAspSerGlyLeuGluVolAsnlleVolThrAjoSerGlnTyrAI 

480 490 500 

CCTGGGCATCATCCAGCCCCADCCTGATCAGTCTGACTCTGACCTCGTGAAttAGATCATTGAGCAGCTW^ 
oLeuGlylielleGlnAloGlnProAspGlnSerGIUSerGluLeuVolAsnGlniielieGiiClnLeuiieLyslysG 

510 520 530 

ACAACGTGTACCTGCCCT{»7CCCTG(«:ACmCCATTGGC«BCAA76AG^ 
luLysVolTyrLeuAlDTrpVolProAloHisLysGlylleGlyGlyAsnGluGlnVolAspLysLeuVoiSerAlcGly 

540 550 

A7CAGGAAGG7GC7G77CCTGBA7CGCA77CACAAGCCCmA7GAGCA7GAGAAG7ACCACT 
1 leArgLysVolLeuPbeLeuAspGlyl leAsplysAl 0Gl.nAspGluHisGluLys7yrHisSerAsn7rpAr9AlcMe 
560 570 580 
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GGCCTCTG/^TlCAACCTGCCX:CCTGTaTGGCTAAGGAGATTGTGGCClCtTGTGACAACTGCCADCTGM^ 
t A I oSer AspPheAsnl euProPr oVol Vo I A I oLysG i u 1 1 eVo I A I oSerCysAspLysCysC I nL euLysG I yG I uA 

590 600 6)0 

CCATGCATGGGCAGGTGGACTGCia^TGCCATCTGGCAGCTGGCCTGCACCCAXTGGACGGCAAGGTGATC^ 
IcWelHisClyGlnVolAspCysSerProClylleTrpGlnLeuAloCysTbrHisLci/SliXJIyLysVollleLeuVol 

" " 620" "~ 630 

GCTGTGCATGTGGCCTOaCTACATTGAGGCTGADCTGAmTGCTGAGACAHKXAW 
Al dVo IH i sVo I A I oSerG I y Ty r 1 1 eG I uAI oG I uVo I ! I eProA I fiG I uThrG I yG 1 nD I uThr A I oTyrPheLeuLe 
640 6M 660 

GAAGClGGCTGGCAGGTGGCCTGTGAAGACCATCCADCTGCC^TGGCTCCAACllCACTOGGrc 
uLvsLeuAldSlyArgTrpProVolLysThrlleHisThrAloAsnGlySerAsnPheThrGlyAloThrVolArgAIoA 

670 680 690 

CCTGCTGGTGGCCTCCCATCVy^GCAGGAGnTGGCAlCCCCTACAACCCCCAClCCCAGGGCGTGGlGGa^ 
loCysTrpTrpAloGlylleLysGlnGluPheGlylleProTyrAsnProGlnSerGlnGlyVolVolAloSerMelAsn 

700 710 

AAGGAGCTGAAGAAGA1CATTGGGCAGGTGAGGGACCACCCT6AGCACC7GAAGACAGC7GTGCAGA7GGCTGTG7TCAT 
LysGluLeuLyslysllelleGlyGlnVotArgAspGlnAloGluHisLeuLysThrAloVolGlnUelAloVolPhell 
720 . 730 .:. 740 

CCACAACTTCAAGAGGAAGGGGGGaTOGGGGGCTACTCCGCTGGGGAGAGGATTGTGGACATCATTGCCACAGA^^ 
eH i sAsnPheLysAr gL ysG I yG 1 y 1 1 eG I yG I yTyrSer Al oG lyG I uAr g 1 1 eVo lAsp 1 1 e 1 1 eAI oThrAsp II eG 

750 760 T'O 

AGACCAAGGAGCTCCAGAAGCAGATCACCAAGATCCAGAACTTCADGGTGTACTACAGGGACTCCAKAAO^ 
InThrlysG luLeuC InLysG Inl leThrLysl leG InAsnPhcArgVolTyrTyrArgAspSerArgAsnProLeuTrp 

' ' 780 790 

AAGGGCCCTCCCAAGCTGClCT(»AAGGaXAGGCGGCTGTGGTGATCCAGGACAACTCTGACATCAAGGTGGTG^ 
lysGlyProAlolysLeuleuTrpLysClyGliClyAloVoiyolUeGlnAspAsnSerAsplleLysVolVolProAr 

800 BID 820 

GABGAAOJimATCATCAGGGACTATGGCMJCADATGDCTDGGGATGACTGIGm 
QArgLysAldysIlelleAroAspTyrClylysGlnl^lAloGlyAspAspCysYqlAloSerArgplnAspGIute^ 

' B30 B40 850 



AAACCCCGGGCAGATCI (SEQ id no.; 3) 

Xx W (SEQ n> NO: 4) 
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WT 
OPT 

WT 
OPT 
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OPT 
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WT 
OPT 

WT 
OPT 

WT 
OPT 



AT6 GET SsC AAS TGG TCA AAA CGT AGT CTG CCT GGA TGG TCT 

I ! )l ill ill III II II I III 11 II III II 
AT6 GOC GGC AAG TGG TCC AAG AGG TCCGTG CCC GGC TGG TCC 

HGGKWSKRSVPGWS 

ACT GTA AGG' GAA A5A ATG AGA CGA BCT GAG CCA GCA GCA GAT 

II II III II II III II I II III II II II, 11 
ADC GTG AGG GAG AGG ATG AGG AGG GCC GAG CCC GCC GCC GAC 

TV R ERHRRAEPAAD 
AGG GTG AGA CGA ACT &AG CCA GCA GCA GTA GGG GTG GGA GCA 

III 1 1 II I II III II II II II II mil n 

ASG GTG AGG AGG ACC GAG CCC GCC GCC GTG GGC GTG GGC GCC 
RVRRTEPAAVGVGA 

GTA TCT CGA GAC CTG GAA AAA CAT GGA GCA ATC ACA AGT AGO 
I 



. ,, I III ill II 11 II II 11 111 n _ I 

GTG TCC AGS GAC CTG GAG AAG CAC GGC 6CC ATC ACC TCC TCC 
V S R D L E K H G A 1 T S S 

AAT ACA GCA GCT ACC AAT GCT GAT TGT 6CC TGG CTA GAA GCA 

II II II II III II II II II III II II II II 
AAC ACC GCC GCC ACC AAC GCC GAC TGC GCC TGG CTG GAG GCC 
NTAATNADCAWLEA 



CAA GAG GAT GAG GAA GTG GGT TTT CCA GTC AGA CCT CAG GTA 

II 111 1 111 II I 1 II I II II II II 111 n 

CAG GAG GAC GAG GAG GTG GGC TTC CCC GTG AGG CCC CAG GTG 
QEDEEV GF PVRPQV 

CCT TTA AGA CCA ATG ACT TAC AAG GGA GCT GTA GAT CTT AGC 

II 11 II III II III III II II II II Ml 
CCC CTG AGG CCC ATG ACC TAC AAG GGC GCC GTG GAC CTG TCC 

PL R PMTYK GAVDLS 

CAC TTT TTA AAA GAA AAG G6G GGA CTG GAA GGG CTA ATT CAC 

III II I II II III II II III 11 II I II III 
CAC TTC CTG AAG GAG AAG GGC GGC CTG GAG GGC CTG ATC CAC 

HFLK E KG 6LE6LIH 
TCA CAG AAA AGA CAA GAT ATC OT GAT CTG TQB GTC TAC CAC 



II III II II II II I I II II 111 II II III _ 
TCC CAG AAG AGG CAG GAC ATC CTG GAC CTG TOG GTG TAC CAC 

SQKRQOIL OLWV YH 

ACA CAA GGC TAC TTC CCT GAT TGG CAG AAC TAC ACA CCA GGG 

II II III III I I II II III HI IIU llil Ur 
ACC CAG SBC TAC TTC CCC GAC TGG CAG AAC TAC ACC CCC 6BC 

TQG YFP DWQNYTPG 



-42 

•14 

-84 

-26 

-126 

•42 

•168 

-56 
•210 

■70 
•252 

•84 
•294 

•9B 
•336 

•112 
•378 

•126 
•420 

■ 

•140 



FIGURE 19A 



47/144 



wo 02/22080 PCT/USOl/28861 



OPT 

WT 
OPT 

WT 
OPT 

VfT 

OPT 

vn" 

OPT 

WT 
OPT 



CCA GSA ATC AGA TTT CCA TTG ACC TFT EGA TGG TGC TTC AAG 

II II III II II 11 II III II II III III i I III 
CCC GGC ATC AGS TTC CCC CTG ACC TTC GGC TGG TGC TTC AAG 
PGIRFPLTF6WCFK 

CTA GTA CCA GTT GAG CCA GAA AAG GTA 6AA GAG GCC AAT 6AA 

II I II II III II II III II II iii ill II II 
CTG GTG CCC GTS GAG CCC GAG AAG GTG GAG GAS GCC AAC GAG 
LVPVEPEKVEEANE 

GGA GAG AAC AAC TGC TTG TTA CAC CCT ATG AGC CAG CAT GGG 

II III III III III II I HI II III I Ml II II 

GGC GAG AAC AAC TGC CTG CTG CAC CCC ATG TCC CAG CAC GGC 
GEN NCLLHPHSQHG 

ATA GAG GAC CCG GAS AAG GAA GTG TTA GAG TSG AGE TTT GAC 

II III III II III III II III I I III III III I I II 
ATC GAG GAC CCC GAG AAG GAS GTG CTG GAG TGG AGG TTC GAC 
lEDPEKEVLEWRFD 

AGC AAG CTA GCA TTT CAT CAC GTG GCC CGA GAG CTG CAT CCG 

I III II II II II ill III III I 111 ill II II 
TCC AAG CTG GCC TTC CAC CAC GTG GCC AGG GAG CTG CAC CCC 
SKLAFHHVARELHP 



-462 

•154 
•504 

• 168 
•546 

•162 
•5BB 

-196 
•630 

-210 
•651 



GAG TAC TAC AAG GAC TGC T6A (SEQ ID N0:30) 
GAG TAC TAC AAG GAC TGC TAA (contained rithin SEQIDNO:?) 

£ Y Y K 0 C (SEQlDNO:10) -216 
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fi»/tiih 




HlV^Pc^inact (opt) 



IIRKpds^i (Pac/|klX/pacM50^CMViniiyGHpA(str.) 

7S13 it 




KanR 



BglH/phosphatase 



Bgll l(isolate po\ gene fragment) 



Ligation 

Screen for correct orientation of pol 




HIV'Pol-tnact (opt) 




CIb^ digest pre-plasmid 



Ma IIOnMIS. 

Digest with Pad and Bst1107 t (orBstZIO? I) 



T 



Bacterial Homobgous Recomb^ation (BJ51 B3) 
XL-1 blue (plasmid ampUfcaiion) 



HIV-Pot'inact (opt) 

IfiSO} 



Figure 1: Construction of pre-plasmid *^MRKAcl5por 
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Bgtll/phosphatsse 



B9t11(is&iatenef gene fragment) 



Ligation 

Screen for correct orientation of nef , 



MA 




fcr licni3IO 




Clal digest pre-plasmld 



Digest with Pad and Bst1107 I (or BstZIOT I) 



Bacterial Honrtobgous Recombination (BJ5183) 
XL*1 blue tptasmid ampfification) 



Figure 2: Construction of pre-plasmid •pMRKAdSneT. 
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MRKAdSpol MER1062 
(MRKAdS Pre-Adenovira] Vector Containing tbe lA opt pol Coding Kegion) 



1 CATCATCAAT AATATACCTT ATTTTGGATT GAAGCCAATA TGATAATGAG 
GTAGTAC5TTA TTATATGGAA TAAAACCTAA CTTCGGTTAT ACTATTACTC 

51 GGGGTGGAGT TTGTGACGTG GCGCGGGGCG TGGGAACGGG GCGGGTGACG 
CCCCACCTCA AACACTGCAC CGCCCCCCGC ACCCTTGCCC CGCCCACTGC 

101 TAGTAGTGTG GCGGAA6TGT GATGTTGCAA GTGTGGCGGA ACACATGTAA 
ATCATCACAC CGCCTTCACA CTACAACGTT CACACCGCCT TGTGrACATT 

151 GC6ACGGATG TGSCAAAAGT GACGT?TTTG GTGTGCGCCG GTGTACACAG 
CGCTGCCTAC ACCGTTTTCA CTGCAAAAAC CACACGCGGC CACATGTGTC 

201 GAAGTGACAA TTTTCGCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGGG 
CTTCACTGTX AAAAGCGCGC CAAAATCCGC. CTACAACATC ATTTAAACCC 

251 CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAA6AGGA 
GCATTGGCTC ATTCTAAACC GGTAAAAGCG CCCTTTTGAC TTATTCTCCT 

301 AGTGAAATCT GAATAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA 
TCACTTTAGA CTTATTAAAA CACAATGAGT ATCGCGCATT ATAAACAGAT 

351 GGGCCGCGGG GACTTTGACC GTTTACGTGG AGACTCGCCC AGGTGTTTTT 
CCCGGCGCCC CTGAAACTGG CAAATGCACC TCTGAGCGGG TCCACAAAAA 

4 01 CTCAGGTGTT TTCCGCGTTC CGGGTCAAAG TTGGCGTTTT ATTATTATAG 
GAGTCCACAA AAGGCGCAAG GCCCA6TTTC AACCGCAAAA TAATAATATC 

451 GCGGCCGCGA TCCATTGCAT ACGTTGTATC CATATCATAA TATGTACATT 
. CGCCGGCGCT AGGTAACGTA TGCAACATAG GTATAGTATT ATACATGTAA 



501 TATATTGGCT CATGTCCAAC ATTACCGCCA TGTTGACATT GATTATTGAC 
ATATAACCGA 6TACAGGTTG TAATGGCGGT ACAACTGTAA CTAATAACTG 

551 TAGTTATTAA TAGTAATCAA TTACGGGGTC ATTAGTTCAT AGCCCATATA 
ATCAATAAPT ATCATTAGTT AATGCCCCAG TAATCAAGTA TCGGGTATAT 

601 TGGAGTTCCG CGTTACATAA CTTACGGTAA AT6GCCCGCC TGGCTGACCG 
ACCTCAAGGC GCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC 

651 CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 
GGGTTGCT6G GGGC6GG7AA CTGCAGTIAT TACT6CATAC AAGGGTATCA 

701 AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGGTGGAG TATTTACGGT 
. TTGCGCTTAT CCCTGAAAGG TAACTGCAGT TACCCACCTC ATAAATGCCA 

751 AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATAtGCC AAGTACGCCC 
OTTGACGGGT GAACCGTCAT GTAGTTCACA TAGTATACGG TTCATGCG66 

801 CCTATTGACG TCAATGACGG TAAAT6GCCC GCCTGGCATT ATGCCCAGTA 
GGATAACTGC AGTTACTGCC ATTTACCGGG C6GACCGTAA TACG6GTCAT 

851 CATGACCTTA TGGGACTTTC CTACTTQGCA GTACATCTAC GTATTAGTCA 
GTACTGGAAT ACCCTGAAAG GATGAACCGT CATGTA6ATG CATAATCAGT 
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901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 



TCGCTATTAC ^ffbCTOXTG CGGTTTTQGC AGTACATCAA TGGGCd^feA 
AGCGATAATG GTACCACTAC GCCAAAACCG TCATGTAGTT ACCCQCACCT 

TAGCGGTTTG ACTCACGGGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 
ATCGCCAAAC TGAGTGCCCC TAAAGGTTCA GAGGTGGGGT AACTGCAGTT 

TGGGAGTTTG -rrTrGGCACC AAAATCAACG GGACTTTCCA AAATGTCGTA 
ACCCTCAAAC AAAACCGTGG TTTrAGTTGC CCTGAAAGGT TTTACAGCAT 

ACAACTCCGC CCCATTGACG CAAATGGGCG GTAGGCGTGT ACGGTGGGAG 
TGTTGAGGCG GGGTAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC 



GTCTATATAA GCAGAGCTCG TTTAGTGAAC CGTCAGATCG CCTGGAGACG 
CAGATATATT CGTCTCGAGC AAATCACTTG GCAGTCTAGC GGACCTCTGC 

CCATCCACGC TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC 
OGTAGGTGCG ACAAAACTGG AGGTATCTTC TGTGGCCCT6 GCTAGGTCGG 

TCCGCGGCCG GGAACGGTGC ATTGGAACGC CGATTCCCCG TGCCAAGAGT 
AGGCGCCGGC CCTTGCCACG TAACCTTGCG CCTAAGGGGC ACGGTTCTCA 

GAGATCTACC ATGGCCCCCA TCTCCCCCAT TGAGACTGTG CCTGTGAAGC 
CTCTAGATGG TACCGGGGGT AGAGG5GGTA ACTCTGACAC GGACACTTCG 

TGAAGCCTGG CATGGATGGC CCCAAGGTGA AGCAGTGGCC CCTGACTGAG 
ACTTCGGACC GTACCTACCG GGGTTCCACT TCGTCACCGG GGACTGACTC • 

GAGAAGATCA AGGCCCTGGT GGAAATCTGC ACTGAGATGG AGAAGGAGGG 
CTCTTCTAGT TCCGGGACCA CCTTTAGACG TGACTCTACC TCTTCCTCCC 

CAAAATCTCC AAGATTGGCC CCGAGAACCC CTACAACACC CCTGTGTTTG 
GTTTTAGAGG TTCTAACCGG GGCTCTTGQG GAT6TTGTGG GGACACAAAC 

CCATCAAGAA GAAGGACTCC ACCAAGTGGA GGAAGCTGGT GGACTTCAGG 
GGTAGTTCTT CTTCCTGAGG TGGTTCACCT CCTTCGACCA CCTGAAGTCC 

6AGCTGAACA AGAGGACCCA GGACTTCTGG GAGGTGCAGC TGGGCATCCC 
CTCGACTTGT TCTCCTGGGT CCTGAAGACC CTCCACGTCXS ACCCGTAGGG 

CCACCCCGCT GGCCTGAAGA AGAAGAAGTC TGTGACTGTG CTGGCTGTGG 
GGTGGGGCGA CCGGACTTCT TCTTCTTCAG ACACTGACAC GACCGACACC 

GGGATGCCTA CTTCTCTGTG CCCCTGGATG AGGACTTCAG GAAGTACACT 
CCCTACGGAT GAAGAGACAC GGGGACCTAC TCCTGAAGTC CTTCATGTGA 

GCCTTCACCA TCCCCTCCAT CAACAATGAG ACCCCTGGCA TCAGGTACCA 
CGGAAGTCGT AGGGGAG6TA GTTGTTACTC T6GGGACCGT AGTCCATGGT 

GTACAATGTG GTGCCCCAGG 6CTGGAAGGG CTCCCCTGCC ATCTTCCAGT 
CATCTTACAC GACGGG6TCC CGACCTTCCC GAGGGGACGG TAGAAGGTCA 

CCTCCATGAC CAAfiATCCTG GAGCCCTTCA GGAAGCAGAA CCCTGACATT 
GGAGGTACTG GTTCTAGGAC CTCGGGAAGT CCTTC6TCTT qOGACTGTAA 

GTGATCTACC AGTACATGGC T6CCCTGTAT GTG6GCTCTG ACCTGGAGAT 
CACTAGATGG TCATGTACCG ACCGGACATA CACCCGAGAC TGGACCTCTA 
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1851 TG6GCA6CAC JMjj^CCAAGA TTGAGGA6CT GAG6CAGCAC CTGCTG^BT 
ACCCGTCGT6 TCCTGGTTCT AACTCCTCGA CTCCGTCGTG GACGAC^Sa 

1901 GGGGCCTGAC CACCCCTGAC AAGAAGCACC AGAAGGAGCC CCCCTTCCTG 
CCCCGGi^CTG G7GGG6ACT6 TTCTTCGTGG TCTTCCTC6G OGOGAAGGJ^C 

1951 TGGATGGGCT AT6AGCTGCA CCCCG?^CAAG TGGACTGTGC AGCCCATTGT 
ACCTACCCGA TUCTCGACGT GG6GCTGTTC ACCTGACACG TCGC3GTAACA 

2001 GCT6CCTGA6 MGGACTCCT GGACIYSTGAA TGACATCCA6 AAGCT66TGG 
CGACGGACTC T7CCTGAGGA CCTGACACTT ACTGTA66TC TTCQACCACC 

2051 GCAAGCTGAA C7GG6CCTCC CAAMtTACC CTGGCATCAA GGTGA66CAG 
CGTTCGACTT GACCCGGAGG GTTTAGATGG GACCGTAGTT CCACTCC6TC 

2101 CTGTGCAAGC TCCTGAGGGG CACCAAG6CC CTGACTGAG6 7GATCCCCCT 
GACACGT7CG ACGACTCCCC GTOGTTCCGG GACTGACTCC ACTAGG6GGA 

2151 GACTGAGGA6 GCTGAGCTGG AGCTGGC^GA GAACAGGGA6 ATCCT6AAGG 
CXGACTCCTC CGACTCGACC TCGACCGACT CTTGTCCCTC TAGC5ACTTCC 

2201 AOCCTGTGCA TG6GGT0TAC TAT6ACCCCT CCAAGGACCT GATTGCTGAG 
TCGGACACGT ACCCCACATG ATACTGGGGA GGTTCCT6GA CTAACGACTC 

2251 ATCCAGAAGC AGGGCCAG6G CCAGTG6ACC TACCAAATCT ACCA6GAGCC 
TAGGTCTTCG TCCCGGTCCC 6CTCACCT6G AT6GTTTAGA TGGTCCTCG6 

2301 CTTCAAGAAC CTa^GACTG GCAAGTATGC CAG6ATGAGG GGGGCCCACA 
GAAGTTCTTG GACTTCTGAC CGTTCATACG GTCCTACTCC CCCXTGGGTGT 

2351 CCAATGAT6T GAAGCAGCTG ACTSAOGCTG TGCAGAAGAT CACCACTGAG 
GGTfACTACA C'TTC<il!C'GPiC''K 

2401 TCCATTGTGA TCTGGGGCAA GACCCCCAAG TTCAAGCTGC CCATCCACSAA 
AGGTAACACT AGACCCCGTT CTGGGGGrrC AAGTTCGACG 6GTAGGTCTT 

2451 GGAGACCTGG GAGACCTGG7 6GACTGAGTA CTGGCAGGCC ACCTGGATCC 
CCTCTGGACC CTCTGGACCA CCTGACTCAT GACCGTCCGG TQGACCTAGG 

2501 CTGAGTGGGA GTTTGTGAAC ACCCCCCCCC TGGTGAAGCT GTGGTACCAG 
GACTCACCCT CAAACACTTG TGGGGGGGGG ACCACTTCGA CACCATGGTC 

2551 CTGGAGAA66 AGCCCATTGT GGGGGCTGAG ACCTTCTATG TGQCTGG6GC 
GACCTCTTCC TCGGGTAACA CCCCCGACTC TGGAAGATAC ACCGACCCCG 

2601 TGCCAACAGG GAGACCAAGC TGGGCAAGGC TGGCTATGTG ACCAACAGGG 
' ACGGTTGTCC CTCTGGTTCG ACCCGTTCCG ACCGATACAC TQGTTGTCCC 

2651 GCAGGCAGAA GGTGGTGACC CTGACT6ACA CCACCAACCA GAAGACTGCC 
CGTCCGTCTT CCACCACTGG GACTGACTGT GGTGGTTGGT CTTCTGACGG 

2701 CTCCA6GCCA TCTACCTGGC CCTCCAGGAC TCTGGCCTGG AGGTGAACAT 
6AGGTCC6GT AGA7GGACCG GGAGGTCCTG AGS^CCGGACC TCCACTTGTA 

2751 TCTGACTGCC TCCCAGTATG CCCTQGGCAT CATCCA6GCC CAGCCT6ATC 
ACACTGACGG AGGG7CATAC GGGACCCGTA GTAGGTCCGG GTCGGACTA6 
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2801 



AGTCTGAGTC 



CTQGTG 



J^ACCAGT^TCA TTGAGCAGCT GA' 




TCACiACTCAG ACTCGACCAC TTGGTCTAGT AACTCGTCGA CTAGTTOTC 

2851 GAGAAG(?TGT ACCT66CCTG 6GTGCCTGCC CACAAGGGCA TTGGGGGCAA 
CTCTTCCACA TGGACCGGAC CCACGG?^GG GTGTTCCCGT AACCCCCGTT 

29D1 T6AGCA0GTG GACAAGCTG6 TGTCTGCTGG CATCAQGAAG GTGCTGTTCC 
ACTCGTCC«iC CTGTTCGACC ACAGACGACC GTAGTCCTTC CACGACAAGG 

2951 TGGATGGCAT TGACAAGGCC CAGGATGAGC ATGAGAAGTA CCACTCCAAC 
ACCTACCGTA ACTGTTCCGG GTCCTACTCG TACTCTTCAT CGTGAGGTTG 

3001 TGGAGOGCTA TGGCCTCTGA CTTCAACCTG CCCCCTGTG6 TGGCTAAGGA 
ACCTCCCGAT ACC6GAGACT GAAGTTGGAC GGGGGACACC ACC6ATTCCT 

3051 GATTGTGGCC TCCTGTGACA AGTGCCAGCT GAAGGGGGAG GCCATGCATG 
CTAACACCGG AGGACACTGT TCAC6GTCGA CTTCCCCCTC CGGTACGTAC 

3101 GGCAGGTGGA CTGCTCCCCT GGCATCTGGC AGCTGGCCTG CACCCACCTG 
CCGTCCACCT GACGAGGGGA CCGTAGACCG TCGACCGGAC GTGGGTGGAC 

3151 GAGG6CAAGG TGATCCTGGT GGCTGTGCAT GTGGCCTCCG GCTACATTGA 
CTCCCGTTCC ACTAGGACCA CCGACACGTA CACCGGAGGC CGATGTAACT 

3201 GGCTGAGGTG ATCCCTGCTG AGACAGGCCA GGAGACTGCC TACTTCCTGC 
CCGACTCCAC TA6GGACGAC TCTGTCCGGT CCTCT6ACGG ATGAA6GAC0 . 

3251 TGAAGCTGGC TGGCAGGT6G CCTGTGAAGA CCATCCACAC TGCCAATGGC 
ACTTCGACXTG ACCGTCCACC GGACACTTCT GGTAGGTGTG ACGGTTACCG 

3301 TCCAACTTCA CTGGGGCCAC AGTGAGGGCT . GCCTGCT6GT GGGCTGGCAT 
AGGTTGAACT GACCCCCGTG TCACTCCCGA CGGACGACCA CCCGACCGTA 

3351 CAAGCAGGAG TTTGGCATCC CCTACAACCC CCAGTCCCAG GGGGTGGTGG 
GTTC6TCCTC AAACCGTAGG GGATGTTGGG GGTCAGGGTC CCCCACCACC 

3401 CCTCCAK3AA CAAGGAGCTG AA6AAGATCA TTGQGCAGGT GAGGGACCAG 
GGAGGTACTT GTTCCTCGAC TTCTTCTAGT AACCCGTCCA CTCCCTGGTC 

3451 GCTGAGCACC TGAAGACAGC TGTGCAGATG GCTGTGTTCA TCCACAACTT 
CGACTCGTGG ACTTCTGTC6 ACACGTCTAC CGACACAAGT AGGTGTTCAA 

3501 CAAGAGGAAG GGGGGCATCG 6GGGCTACTC CGCTGGGGAG AGGATT6TGG 
GTTCTCCTTC CCCCCGTAGC CCCCGATGAG GCGACCCCTC TCCTAACACC 

3551 ACATCATTGC CACAGACATC CAGACCAAGG AGCTCCA6AA GCAGATCACC 
TGTAGTAACG GTGTCTGTAG GTCTGGTTCC TCGAGGTCTT CGTCTAGTGG 

3601 AAGATCCAGA ACTTCAGGGT 6TACTACAGG GACTCCAGGA ACCCCCTGTG 
TTCTAGGTCT TGAAGTCCCA CATGaTGTCC CTGAGGTCCT TGGGGGACAC 

3651 GAAGGGCCCT GCCAAGCTGC TGTGGAA6GG GQAGGGGGCT GTGGTGATCC 
CTTCCCGGGA CGGTTCGACG ACACCTTCCC CCTCCCCCGA- CACCACTAGG 

3701 AGGACAACTC TGACATCAAG GTGGTGCCCA GGAGGAAGGC CAA(3ATCATC 
TCCTGTTGAG ACTGTAGTTC CACCACGGGT CCTCCTTCCG GTTCTAGTAG 
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3751 AGGGACTATO Q|fclGCAGAT GGCTGGGGAT GACTGTGTGG CCltChmCA 
TCCCTGATAC CTWCGTCTA CCGACCCCTA CTGACACACC GGAGGoWgT 

3801 GGATGAGGAC TAAAGCCCGG GCAGATCTGC TGTGCCTTCT AGTTGCCAGC 
CCTACTCCTG AOTTCGGGCC CGTCTAGACG ACACGGAAGA TCAACGGTCG 

3851 CATCTGTTGT TTGCCCCTCC CCCGTGCCrT CCTTGACCCT GGAAGGTGCC 
GTAGACAACA AAC6666A6G GGGCAC6GAA 6GAACTG6GA CCTTCCACGG 

3901 ACTCCCACTG TCCTTTCCTA ATAAAATGAG GAAATTGCAT CQCATTGTCT 
TGAG6GTGAC AGGAAAGGAT TATTTTACTC CTTTAACGTA GCGTAACAGA 

3951 GAGTAGGTGT CATTCTATTC TGGGGGGTGG GGTGGGGCAG GACAGCAAGG 
CTCATCCACA GTAAGATAAG ACCCCCCACC CCACCCCGTC CTGTCGTTCC 

4001 GGGAGGATTG GGAAGACAAT AGCAGGCATG CTGGGGATGC GGTGGGCTCT 
CCCTCCXAAC CCTTCTGTTA TCGTCCGXAC GACCCCTACG CCACCCGAGA 

4051 ATGGCCGATC GGCGCGCCGT ACT6AAATGT GTGGGC6TGG CTTAAGGGTG 
TACCGGCTAG CCOCGCGGCA T6ACTTTACA CACCCGCACC 6AATTCCCAC 

4101 GGAAAGAATA TATAAGGTGG GGGTCTTATG TAGTTTTGTA TCTGTTTTGC 
CCTTTCTTAT ATATTCCACC CCCAGAATAC ATCAAAACAT AGACAAAACG 

4151 AGCAGCCGCC GCCGCCATGA -GCACCAACTC GTTTGATGGA AGCATTGTGA 
TCGTCGGCGG CGGCGGTACT CGTGGTTGAG CAAACTACCT TCGTAACACr? 

4201 GCTCATATTT GACAACGCGC ATGCCCCXAT GGGCCGGGGT GCGTCAGAAT 
.CGAGTATAAA CTGTTGCGCG TACGGGGGTA CCCGGCCCCA CGCAGTCTTA 

4251 6TGATGGGCT CCAGCATTGA TGGTCGCCCC GTCCTGCCCG CAAACTCTAC 

CACTACCCGA- GGTC6TAACT -ACCAGCGGGG CAGGACGGGC GTTTGAGATG 

4301 TACCTTGACC TACGAGACCG TGTCTGGAAC GCCGTTGGAG ACTGCAGCCT 
ATGGAACTGG ATGCTCTGGC ACAGACCTTG CGGCAACCTC TCACGTCGGA 

4351 CCGCCGCCGC TTCAGCCGCT GCAGCCACCG CCCGCGGGAT TGTGACTGAC 
GGCGGCGGCG AAGTCGGCGA CGTCG3TGGC GGGC6CCCTA ACACTGACTG 

4401 TTTGCTTTCC TGAGCCCGCT TGCAAACAGT GCAGCTTCCC GTTCATCCGC 
AAACGAAAGG ACTCGGGCGA ACGTTTGTCA CGTCGAAGGG CAAGTAGGC6 

4451 CCGCGATGAC AAGTTGACGG CTCTTTTGGC ACAATTGGAT TCTTTGACCC 
G6CGCTACTG TTCAACTGCC GAGAAAACCG TGTTAACCTA AGAAACTGGG 

4501 GGGAACTTAA TGTCGTTTCT CAGCAGCTGT TGGATCTGCG CCAGCAGGTT 
CCCTTGAATT ACAGCAAAGA GTCGTCGACA ACCTAGACGC GGTCGTCCAA 

4551 TCTGCCCTGA AGGCTTCCTC CCCTCCCAAT GCGGTTTAAA ACATAAATAA 
AGACGGGACT TCCGAAGGA6 GGGAGGGTTA CGCCAAATTT TGTATTTATT 

4601 AAAACCAGAG TCTGTTTGGA TTTGGATCAA GCAAGTGTCT TGCTGTCTTT 
TTTTGGTCT6 AGACAAACCT AAACCTAGT7 CGTTCACAGA ACGACAGAAA 

4651 ATTTAGGGGT TTTGCGCGCG CGGTAGGCCC GGGACCAGCG GTCTCGGTCG 
TAAATCCCCA AAACGCGCGC GCCATCCGGG CCCTGGTCGC CA6AGCCAGC 
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4701 TTGAGGOTCC 
AACTCCCAGG 



ITATTTT TTCCAGGACG TGGTAAAGe^MSiS* 
ATAAAA AAGGTCCTGC ACCATTTCCA CO^ 




4751 GTTCAGATAC ATGGGCATAA GCCCGTCTCT GGGGTGGAGG TAGCACCACT 
CAAGTCTATG TACCCGTATT CGGGCAGAGA CCCCACCTCC ATCGTGGTGA 

4801 GCAGAGCTTC ATGCTGC6GG GTGGTGTTGT AGATGATCCA GTCGTAGCAG 
CGTCTCGAAG TACGACGCCC CACCACAACA TCTACTA6GT CAGCATCGTC 

4851 GACCGCTGGG CGTGGTGCCT AAAAAJTGTCT TTCAGTAGCA AGCTGATTGC 
CTCGCGACCC QCACCAC6GA TTTTTACAGA AAGTCATCGT TCGACTAACG 

4901 CAGGGGCAGG CXrCTTGGTGT AAGTGTTTAC AAA6CGGTTA AGCTGGGATG 
GTCCCCGTCC GGGAACCACA TTCACAAATG TTTCGCCAAT TCGACCCTAC 

4951 GGTGCATACG TG6GGATATG AGAT6CATCT TGGACTGTAT TTTTAGGTTG 
CCACGTATGC ACCCCTATAC TCTACGTAGA ACCTGACATA AAAATCCAAC 

5001 GCTATGTTCC CAOCCATATC CCTCCGGGGA TTCATGTTGT GCAGAACCAC 
CGATACAAGG GTCGGTATAG GGAGGCCCCT AAGTACAACA CGTCTTGGTG 

5051 CAGCACAGTG TATCCGGTGC ACTTGGGAAA TTTGTCATGT AGCTTAGAAG 
GTCCTGTCAC ATAGGCCACG TGAACCCTTT AAACAGTACA TCGAATCTTC 

5101 GAAATGCGTG GAA6AACTTG GAGACGCCCT TGTGACCTCC AAGATTTTCC 
CTTTACGCAC CTTCTTGAAC CTCTGCGGGA • ACACTGGAGG TTCTAAAAGG 

5151 ATGCATTCGT CCATAATGAT GGCAATGGGC CCACGGGCGG CGGCCTGGGC 
TACGTAAGCA GGTATTACTA CCGTTACCCG GGTGCCCGCC GCCGGACCCG 

5201 GAAGATATTT CTGGGATCAC TAACGTCATA GTTGTGTTCC AGGATGAGAT 
CTTCTATAAA GACCCTAGTG ATTCCACTAT CAACACAAGG TCCTACTCTA 

5251 CGTCATAGGC CATTTTTACA AAGCGCGGGC GGAGGGTGCC AGACTGCGGT 
GCAGTATCCG GTAAAAATGT TTCGCGCCCG CCTCCCAC6G TCTGACGCCA 

5301 ATAATGGTTC CATCCGGCCC AGGGGCGTAG TTACCCTCAC AGATTTGCAT 
TATTACCAAG GTAGGCCGGG TCCCCGCATC AATGGGAGTG TCTAAACGTA 

5351 TTCCCACGCT TTGAGTTCAG ATGGGGGGAT CATGTCTACC TGCGGGGCGA 
AAGGGTGCGA AACTCAAGTC TACCCCCCTA GTACAGATGG ACGCCCCGCT 

5401 TGAAGAAAAC GGTTTCCGGG GTAGGGGAGA TCAGCTGGGA AGAAAGCAGG 
ACTTCTTTTG CCAAAGGCCC CATCCCCTCT ACTCGACCCT TCTTTCGTCC 

5451 TTCCTGAGCA GCTGCGACTT ACCGCAGCCG GTGGGCCCGT AAATCACACC 
AAGGACTCGT CGACGCTGAA TGCCGTCGGC CACCCGGGCA TTTAGTGTGG 

5501 TATTACCGGC TGCAACTGGT AGTTAAGAGA GCTGCAGCTG CCGTCATCCC 
ATAATGGCCG ACGTTGACCA TCAATTCTCT CGACGTCGAC GGCAGTAGGG 

5551 TGAGCAGGGG GGCCACTTCG TTAAGCATGT CCCTGACTCG CMGTTTTCC 
ACTC6TCCCC CCG6TGAA6C AATTCGTACA GGGACTGAGC 6TACAAAAGG 

5601 CTGACCAAAT CCGCCA6AAG GCGCTCGCCG CCCAGCGATA GCAGTTCTT6 
GACTCGTTTA GGCGGTCTTC CGCGAGCGGC G6GTCGCTAT CGTCAAGAAC 
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5651 CAAGGAAGCA AJPfTTTTCA ACGGTTTGAG ACCGTCCGCC GTAGGCl^C 
GTTCCTTCGT MCAAAAAGT TGCCAAACTC TGGCAGGCGG CATCCGTACG 

5701 TTTTGAGCGT TTGACCAAGC AGTTCCAGdC GGTCCCACAG CTCGGTCACC 
. AMUICTCGCA AACT6GTTCG TCAAGGTCCG CCAGGGTG7C GAGCCAGTGG 

5751 TGCTCTACGG CATCTCGATC CAGCATATCT CCTCGTTTCG CGGGTTGGGG 
ACGAGATGCC GTA6AGCTAG GTCGTATAGA GGAGCAAAGC GCCCAACCCC 

58D1 CGGCTTTCGC TGTACGGCAG TAGTCGGTGC TCGTCCAGAC GGGCCA6GGT 
GCCGAAAGCG ACATGCXX3TC ATCAGCCAC6 A6CAGGTCTG CCC6GTCCCA 

5851 CATGTCTTTC CACGGGCGCA GGGTCCTCGT CAGCGTAGTC TGGGTCACGG 
GTACAGAAAG GTGCCCGCGT CXX:AGGAGCA GTCGCATCAG ACCCAGT6CC 

5901 TGAAGGGGTG CX3CTCCGGGC TGCGCGCTGG CCAGGGTGCG CTTGA6GCTG 
ACTTCCCCAC GCGAGGCCCG ACGCGCGACC GGTCCCACGC GAACTCCGAC 

5951 GTCCTGCTGG TGCTGAAGCG CTGCCGGTCT TCGCCCTGCG CGTCGGCCAG 
CAGGACGACC ACGACTTCGC GACGGCCAGA AGCGGGAC6C GCAGCCGGTC 



6001 GTAGCATTTG ACCATGGTGT CATAGTCCAG CCCCTCCGCG GCGTGGCCCT 
CATCGTAAAC TGGTACCACA GTATCAGGTC GGGGAGGCGC CGCACCGGGA 

6051 TGGCGCGCAG CTTGCCCTTG GAGGAGGCGC CGCACGAGGG GCAGTGCAGA 
ACCGCGC6TC GAACGGGAAC CTCCTCCGCG GCGTGCTCCC CGTCACGTCT 

6101 CTTTTGAGGG CGTAGAGCTT GGGCGCGAGA AATACCGATT CCGGGGAGTA 
6AAAACTCCC GCATCTCGAA CCCGCGCTCT TTATGGCTAA GGCCCCTCAT 

6151 GGCATCCGCG CCGCAGGCCC CGCAGACGGT CTCGqATTCC ACGAGCCAGG 
CCGTAGGCGC GGCGTCCGGG GCGTCTGCCA GAGCGTAAGG TGCTCGGTCC 

6201 TGAGCTCTGG CCGTTCGGGG TCAAAAACCA GGTTTCCCCC ATGCTTTTTG 
ACTCGAGACC GGCAAGCCCC AGTTTTTG6T CCAAAGGGGG TACGAAAAAC 

6251 ATGCGTTTCT TACCTCTGGT TTCCATGAGC CGGTGTCCAC GCTCGGTGAC 
TACGCAAAGA ATGGAGACGA AAGGTACTCG GCCACAGGTG CGAGCCACTG 

63 Ol'' GAAAAGGCTG TCCGTGTCCC CGTATACAGA CTTGAGAGGC CTGTCCTCGA 
CTTTTCCGAC AGGCACAGGG GCATATGTCT GAACTCTCCG GACAGGAGCT 

6351 GCGGTGTTCC GCGGTCCTCC TCGTATAGAA ACTCGGACCA CTCTGAGACA 
CGCCACAAGG CGCCAGGAGG AGCATATCTT TCAGCCTGGT GAGACTCTGT 

6401 AAGGCTCGCG TCCAGGCCAG CAC6AAG6A6 GCTAAGTGGG AGGGGTAGCG 
rrCCGAGCGC A66TCCGGTC GTGCTTCCTC CGATTCACCC TCCCCATCGC 

6451 GTCGTTGTCC ACTAGGGGGT CCACTCGCTC CAGGGTGTGA AGACACATGT 
CAGCAACAGG TGATCCCCCA GGTGAGCGAG GTCCCACACT TCTGTGTACA 

6501 CGCCCTCTTC GGCATCAAGG AAGGTGATTG GTTTGTAGGT GTAGGCCACG 
GCGGGAGAAG CCGTAGTTCC 7TCCAC7AAC CAAACATCCA CATCCGGTGC 

6551 TGACCGGGTG TTCCTGAAGG GGGGCTATAA AAGGGGGTGG GGGCGCGTTC 
ACTGCCCCAC AAGGACTTCC CCCXGATATT TTCCCCCACC CCCGCGCAA6 
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6601 GTCCTCACTC TCTTCCGCAT CGCTCTCTGC GACCGCCJCSa^TSOmai^arG 
CA6GAGTGAG ^^iGGCGTA GCGACAGACG CTCCCGGTCG ACAAC(^p.C 

6651 AGTACTCCCT CTGAAAAGCG GGCATGACTT CTGCGCTAAG ATTGTCAGTT 
TCATGAGGGA GACTTTTCGC CCGTACTGAA GACGCGATTC TAACAGTCAA 

5701 TCCAAAAACG AGGAGGATTT GATATTCACC TGGCCCGCGG TGATGCCTTT 
AGGTTTTTGC TCCTCCTAAA CTATAAGTGG ACCGGGCGCC ACTACGGAAA 

6751 GAGGGTGGCC GCATCCATCT GGTCAGAAAA GACAATCTTT TTGTTGTCAA 
CTCCCACCGG CGTAGGTAGA CCAGTCTTTT CTGTTACAAA AACAACAGTT 

6801 GCTTGGTGGC AAACGACCCG TAGAGGGCGT TGGACAGCAA CTTGGCGATG 
CGAACCACCG TTTGCTGGGC ATCTCCCGCA ACCTGTCGTT GAACCGCTAC 

6B51 GAGCGCAGGG TTTGGTTTTT GTCGCGATCG GCGCGCTCCT TGGCCGCGAT 
CTCGCGTCCC AAACCAAAAA CAGCGCTAGC CCCGCGAGGA ACCGGCGCTA 

6901 GTTTAGCTGC ACGTATTCGC GCGCAACGCA CCGCCATTCG GGAAAGACGG 
CAAATCGACG TGCATAAGCG CGCGTTGCGT GGCGGTAAGC CCTTTCTGCC 

6951 TGGTGCOCTC GTCGGGCACC AGGTGCACGC GCCAACCGCG GTTGTGCAGG 
ACCAG6C6AG CAGCCCGTGG TCCACGTGCG CGGTTGGCGC CAACACGTCC 

7001 GTGACAAGGT CAACGCTGGT GGCTACCTCT CCGCGTAGGC GCTCGTTGGT 
CACTGTTCCA GTTGCGACCA CCGATG6AGA GGCGCATCCG CGAGCAACCA 

7051 CCAGCAGAGG CGGCCGCCCT TGCGCGAGCA GAATGGCGGT AGGGGGTCTA 
G6TCGTCTCC GCCGGCGGGA ACGCGCTCGT CTTACCGCCA TCCCCCAGAT 

7101 GCTGCGTCTC GTCCGGGGGG TCTGCGTCCA CGGTAAAGAC CCCGGGCAGC 
CGACGCAGAG CAGGCCCCCC AGACGCAGGT GCCATTTCTG GGGCCCGTCG 

7151 AGGCGCGCGT CGAAGTAGTC TATCTTGCAT CCTTGCAAGT CTAGCGCCTG 
TCCGCGCGCA GCTTCATCAG ATAGAACGTA GGAACGTTCA GATCGCGGAC 

7201 CTGCCATGCG CGGGCGGCAA GCGCGCGCTC GTATGGGTTG AGTGGGGGAC 
GACGGTACGC 3CCCGCC6TT CGCGCGCGAG CATACCCAAC TCACCCCCTG 

7251 CCCATGGCAT GGGGTGGGTG AGCGCGGAGG CGTACATGCC GCAAATGTCG 
GGGTACCGTA CCCCACCCAC TCGCGCCTCC GCATGTACGG CGTTTACAGC 

7301 TAAACGTAGA GGGGCTCTCT GAGTATTCCA AGATATGTAG GGTAGCATCT 
ATTTGCATCT CTCCGAGAGA CTCATAAGGT TCTATACATC CCATCGTAGA 

7351 TCCACCGCGG ATGCTGGCGC GCACGTAATC GTATAGTTCG TGCGAGGGAG 
AG6TGCCGCC TACGACCGCG CGTGCATTAG CATATCAAGC ACGCTCCCTC 

7401 CGAGGAGGTC GGGACCGAGG TTGCTACG6G CGGGCTGCTC TGCTCCCAAG 
GCTCCTCCAG CCCTGGCTCC AACGATGCCC GCCCGACGAG ACGAGCCTTC 

7451 ACTATCTGCC TCAAGATGGC ATGTGAGTTG GATGATATGG TTGGACGCTG 
TGATAGACGG ACTTC11ACC6 TACACTCAAC CTACTATACC AACCTCCGAC 

7501 GAAGACGTTG AAGCTGGCGT CTGTGAGACC TACCGCGTCA CGCACGAAGG 
CTTCTCCAAC TTCGACCGCA CACACTCTGG ATGGCGCAGT QCGTGCTTCC 
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7551 



AGGCGTAGGA 



G^CGCAGC 



TTGTTGAC'CA 



GCTCGGCGGT GACCTG 




TCCGCATCCT CAGCGCGTCG AACAACTGGT CGAOCCGCCA CTGGACGTCC 

7601 TCTAGGGCGC AGTAGTCCAG GGTTTCCTTG ATGATGTCAT ACTTATCCTG 
AGATCCCGCG TCATCAGGTC CCAAAGGAAC TACTACAGTA TCXATAGGAC 

7651 TCCCTTTTTT TTCCACAGCT CGCGGTTGAG GACAAACTCT TCGCGGTCTT 
AGGGAAAAAA AAGGTGTCGA GCGCCAACTC CTGTTTGAGA AGCGCCAGAA 

7701 TCCAGTACTC TTGGATCGGA AACCCGTCGG CCTCCGAACG GTAAGAGCCT 
AGGTCATGAG AACCTAGCCT TTGGGCAGCC GGAGGCTTGC CATTCTCGGA 

7751 AGCATGTAGA ACTGGTTGAC GGCCTGGTAG GCGCAGCATC -CCTTTTCTAC 
TCGTACATCT TGACCAACTG CCGGACCATC CGCGTCGTAG GGAAAAGATG 

7B01 GGGTAGCGCG TATGCCTGCG CGGCCTTCCG GAGCGAGGTC T6CGTGAGCG 
CCCATCGCGC ATACGGACGC GCCGGAAGGC CTCGCTCCAC ACCCACTCOC 

7B51 CAAAGGTGTC CCTGACCATG ACTTTGAGGT ACTGGTATTT GAAGTCAGT6 
6TTTCCACAG GGACTGGTAC TGAAACTCCA TGACCATAAA CTTCA6TCAC 

7501 TCGTCGCATC CGCCCTGCTC CCAGAGCAAA AAGTCCGT6C GCTTTTTGGA 
AGCAGCGTAG GCGGGACGAG GGTCTCGTTT TTCAGGCACG CGAAAAACCT 

7951 ACGCGGATTT GGCAGGGCGA AGGTGACATC GTTGAAGAGT ATCTTTCCCG 
TGCGCCTAAA CCG7CCCGCT TCCACTGTAG CAACTTCTCA TAGAAAGGGC 

8001 CGCGAGSCAT ATAGTTGCGT GT^TGCGGA AGGGTCCCGG CACCTCGGAA 
GCGCTCCGTA TTTCAACGCA CACTACGCCT TCCCAGGGCC GTGGAGCCTT 

8051 CGGTTGTTAA TTACCTGGGC GGCGAGCACG ATCTCGTCAA AGCCGTTGAT 
GCCAACAATT AATGGACCCG CCGCTCGTGC TAGAGCAGTT TCGGCAA^CTA 

8101 GTTGTGGCCC ACAATGTAAA GTTCCAAGAA OCGCGGGATG CCCTTGATGG 
CAACACCGGG TGTTACATTT CAAGGTTCTT CGCGCCCTAC GGGAACTACC 

8151 AA6GCAATTT TTTAAGTTCC TCGTAGGTGA GCTCTTCAGG 6GAGCTGAGC 
TTCCGTTAAA AAATTCAAGG AGCATCCACT CGAGAA6TCC CCTCGACTCG 

8201 CCGTGCTCTG AAAGGGCCCA GTCTGCAAGA TGAGGGTTGG AAGCGACGAA 
GGCACGAGAC TTTCCCGGGT CAGACGTTCT ACTCCCAACC TTCGCTGCTT 

8251 TGAGCTCCAC AGGTCACGGG CCATTAGCAT TTGCAGGTGG TCGCGAAAGG 
ACTCGAGGTG TCCAGTGCCC GGTAATCGTA AACGTCCACC AGCGCTTTCC 

8301 rcCTAAACTG GCGACCTATG GCCATTTTTT CTGGGGTGAT GCAGTAGAAG 
AGGATTTGAC CGCTGGATAC CGGTAAAAAA GACCCCACTA CGTCATCTTC 

8351 GTAAGCGGGT CTTGTTCCCA GCGGTCCCAT CCAAGGTTCG CGQCTAGGTC 
CATTCGCCCA GAACAAGGGT CGCCAGGGTA GGTTCXrAAGC 6CCGATCCAG 

8401 TCGCGCGGCA GTCACTAGAG GCTCATCTCC GCCGAACTTC ATGACCAGCA 
AGCGCGCCGT CAGTGATCTC CGAGTAtSAGG CGGCTTGAAG TACTGGTCGT 

B451 TGAAGGGCAC GAGCTGCTTC CCAAAGGCCC CCATCCAAGT ATA6GTCTCT 
ACTTCCCGTG CTCGACGAAG GGTTTCCGGG GGTAGGTTCA TATCCAGAGA 
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8501 



ACATCGTAOG 




.G^G ACGCTCGGTG CGAGGATGCG AGCC 




TGTAGCATCC ACTCTTTCTC TGCGAOCCAC CCTCCTACGC TCGGCTjSSCC 

6551 GAAGAACTGG ATCTCCCGCC ACCAATTGGA GGAGTGGCTA TTGATGTGGT 
CTTCTTGACC TAGAGGGCGG TGGTTAACCT CCTCACCGAT AACTACACCA 

8601 GAAAGTAGAA GnCCCTGCGA CGGGCCGAAC ACTCGTGCTG GCTTTTGTAA 
CTTTCATCTT CAGGGACGCT GCCCCCCTTG TGACCACGAC CGAAAACATT 

8651 AAACGTGCGC AGTACTGGCA GCGGTGCACG GGCTGTACAT CCTGCACGAG 
TTTGCACGCG TCATGACCGT CGCCACGTGC CCGACATGTA GGACGTGCTC 

8701 GTTGACCTGA CGACCGCGCA CAAGGAAGCA GA6TGQGAAT TTGAGCCCCT 
CAACTGGACT GCTGGCGCGT OTTCCTTCGT CTCACCCTTA AACTCGGGGA 

8751 CGCCTGGCGG GTTTGGCTGG TGGTCTTCTA CTTCGGCTGC TTGTCCTTGA 
GCGGACCGCC CAAACCGACC ACCAGAAGAT 6AA0CCGACG AACAGGAACT 

8801 CCGTCTGGCT GCTCGAGGGG AGTTAC6GTG GATCG6ACCA CCACGCCGCG 
GGCAGACCGA C6AGCTCCCC TCAATGCCAC CTAGCCTGGT GGTGCQGCGC 

8651 CGAGCCCAAA GTCCAGATGT CCGCGCGCGG CGGTCGGAGC TTGATGACAA 
GCTCGGGTTT CAGGTCTACA GGCGCGCGCC GCCAGCCTCG AACTACTGTT 

8901 CATCGCGCAG ATGGGAGCTG TCCATGGTCT GGAGCTCCCG CGGCGTCAGG 
GTAGCGCGTC TACCCTCGAC AGCTACCAGA CCTCGA6GGC GCCGCAGTCC 

8951 TCAGGCGGGA GCTCCTGCAG GTTTACCTCG CATAGACGGG TCAGGGCGCG 
AGTCCGCCCT CGAGGACGTC CAAATGGAGC. GTATCTGCCC AGTCCCGCGC 

9001 GGCTAGATCC AGGTGATACC TAATTTCCAG GGGCTGGTTG GTGGCGGCGT 
CCGATCTAGG 'TCCACTATGG ATTAAAGGTC CCCGACCAAC CACCGCCGCA 

9051 CGATGGCTTG CAAGAGGCCG CATCCCCGCG GCCCGACTAC GGTACCGC6C 
GCTACCGAAC 6TTCTCCGGC GTAGGGGCGC CGCGCTGATG CXATGGCGCG 

9101 GGCGGGCGGT GGGCCGCGGG GGTGTCCTTG CATGATGCAT CTAAAA6CGG 
CCGCCCGCCA CCCGGCGCCC CCACAGGAAC CTACTACGTA GATTTTCGCC 

9151 TGACGCGGGC GAGCCCCCGG AGGTAGGGGG GGCTCCCCAC CCGCCGGGAG 
ACTGCGCCCG CTCGGGGGCC TCCATCCCCC CCGAGGCCTG GGCGGCCCTC 

9201 AGGGGGCAGG GGCACGTCGG CGCCGCGCGC GGGCAGGAGC TGGTGCTGCG 
TCCCCC6TCC CCGTGCAGCC GCGGCGCGCG CCCGTCCTC6 ACCACGACGC 

9251 CGCGTAGGTT GCTGGCGAAC GCGACGACGC GGCGGTTGAT CTCCT6AATC 
GCGCATCCAA C6ACCGCTTG CGCTGCTGCG CCGCCAACTA GAGGACTTA6 

9301 TGGCGCCTCT GCGT6AAQAC GACGGGCCCG GTGAGCTTGA ACCTGAAAGA 
ACCGCGGAGA CGCACTTCTG CTGCCCGGGC CACTCGAACT TGGACTTTCT 

9351 GAGTTCGACA GAATCAATTT CGGTGTCGTT GACGGCGGCC TGGCGCAAAA 
CTCAAGCTGT CTTAGTTAAA GCCACA6CAA CTGCCGCCGG ACCGCGTTTT 

9401 TCTCCTGCAC GTCTCCTGAG TTGTCTTGAT AGGCGATCTC GGCCATGAAC 
AGAGGACGTG CAGAGGACTC AACAGAACTA TCCGCTAGAG CCGGTACTTG 
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9451 



TGCTCGATCT 




CTCCTO 



C0TCX:6GCTC 



GCTCi 




ACG^GCTAGA GAAGGAGG;iC CTCTAGAGGC GCA6GCC6AG CGAGGTGCCA 

9S01 GGCG6CGAGG TCGTTGGAAA TGCG66CCAT GAQCTGCGA6 AA6GCGTT6A 
CCGCCGCTCC AGCAACCTTT ACGCCCGGTA CTCGACGCTC TTCCGCAACT 

9551 GGCCTCCCTC GTTCCAGACG CGGCTGTAGA CCACGCCCCC TTCGGCATCG 
CC6GAGG6AG CAAGGTCTGC GCCGACATCT -0GTGCG6GG6 AAGCCGTAGC 

9601 CGGGCGCGCA TGACCACCTG CGCGAGATT6 AGCTCCACGT GCCGGGCGAA 
GCCCGCGCGT ACTGGTGGAC GCGCTCTAAC. TCGAGGTGCA CGGCCCGCTTT 

9651 GACGGCGTA6 TTTCGCAGGC GCTGAAAGAG GTAGTTGAG6 GTGGTG6C66 
CTGCCGCATC AAAGCGTCCG CGACTTTCTC CATCAACTCC CACCACCGCC 

9701 TGTGTTCTGC CACGAAGAAG TACATAACCC AGCGTCOCAA CGTGGATTCG 
ACACAAGACG GTGCTTCTTC ATGTATTOGG TCGCAGCOTT GCACCTAAGC 

9751 TTGATATCCC CXAAGGCCTC AAGGCGCTCC ATGGCCTCGT AGAAGTCCAC 
AACTATAGGG QGTTCCGGA6 TTCCGCGAGG TACCGGAGCA TCTTCAGGTG 

9801 GGCGAAGTTG AAAAACTGGG AGTTGCGCGC CGACACGGTT AACTCCTCCT 
CC6CTTCAAC TTTTTGACCC TCAACGCGCG GCTGTGCCAA TTGAGGAGGA 

9851 CCAGAAGACG GATGAGCTCG GCGACAGTGT CGCGCACCTC GCGCTCAAAG 
GGTCTTCTGC CTACTCGAGC CGCTGTCACA GCGCCTGGAG CGCGAGTTTC 

9901 GCTACAGGGG CCTCTTCTTC TTCTTCAATC TCCTCTTCCA TAAGGGCCTC 
CGATOTCCCC GGAGAAGAAG AAGAAGTTAG AGGAGAAGGT ATTCCCGGAG 

9951 CCCTOTTTCT TCTTCTGGCG GCG6T6GGGG AGGGGGGACA CGGCOGCGAC 
GGGAAGAAGA AGAAGACCGC CGCCACCCCC TCCCCCCTGT GCCGCCGCTG 

lOOOl GACGGCGCAC CGGGAGGCGG TCGACAAAGC GCTCGATCAT CTCCCCGCGG 
CTGCCGCGTG GCCCTCCGCC AGCT6TTTCG CGAGCTAGTA GAGGGGCGCC 

10051 C6ACGGCGCA TGGTCTCGGT GACGGCGCGG CCGTTCTCCC GGGGGC6CAG 
GCTGCCGCGT ACCAGAGCCA CTGCCGCGCC GGCAAGAGCG CCCCCGCGTC 

10101 TTGGAAGACG CCGCCCGTCA TGTCCCGGTT ATGGGTTCGC GGGGGGCTGC 
AACCTTCTGC GGCGGGCAGT ACAGGGCCAA TACCCAACCG CCCCCCGACG 

10151 CATGCGGCAG GGATACGGCG CTAACGATGC ATCTCAACAA TTGTTGTGTA 
GTACGCCGTC CCTA-TOCCGC GATTGCTACG TAGACTTGTT AACAACACAT 

10201 GGTACTCCGC C6CC6AGGGA CCTGAGCGAG TCCGCATCGA CCGGATCGGA 
CCATGA6GCG GC06CTCCCT G6ACTCGCTC AGGCGTAGCT GGCCTAGCCT 

10251 AAACCTCTC6 AGAAA6GCGT CTAftCCAGTC ACA6TCGCAA GGTAGGCTGA 
TTTO6AGAGC TCTTTCCGCA GATTGGTCAG TCTCAGCGTT CCATCCGACT 

10301 GCACCGTGGC GGGC6CCAGC GGGOGGCGGT OGQGGTTGTT TCTGGCGGA6 
CGTGGCACCG CCCGCCGTCG CCCGCCGCCA GCCCOlACAA AGACCGCCTC 

10351 GTGCTGCTGA 'TGATCTAATT AAAGTAGGCG GTCTTGAGAC GGCGGATGGT 
CACGACGACT ACTACATTAA TTTCATCCGC CAGAACTCTG CCGCCTACCA 
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1040X 



CGACAGAAGC 



TGTCCT 



TGGGTCCQGC 



ctgctgaa^g' 




GCTGTCTTCG otHaCAGGA ACCCAGGCCG GACGACTTAC GCGTCCIMCA 

10451 CGGCCATGCC CCAGGCTTCG TTTTGACATC GGCGCAGGTC TTTGTAGTAG 
GCCGGTACGG GiSTCCGAAGC AAAACTGTAG CCGCGTCCAG AAACATCATC 

10501 TCTTGCATGA GCCTTTCTAC CGGCACTTCT TCTTCTCCTT CCTCTTGTCC 
AGAACGTACT CGGAAA6ATG GCCGTGAAGA AGAAGAGGAA GGAGAACAGG 

10551 TGCATCTCTT GCATCTATCG CTGCGGCGGC GGCGGA6TTT GGCCGTAGGT 
ACGTAGAGAA CGTAGATAGC GACGCCGCCG CCGCCTCAAA CCGGCATCCA 

10601 GGCQCCCTCT TCCTCCCATG CGTGTGACCC CGAAGCCCCT CATCGGCTGA 
CC6CGGGAGA AOGAOGGTAC GCACACIGGG GCTTCGGGGA GTAGCCGACT 

10S51 AGCAGGGCTA G3TCGGCGAC AACGCGCTCG GCTAATATGG CCTGCTGCAC 
TCGTCCCGAT CCAGCCGCT6 TTGCGCGAGC CGATTATACC GGACGAC6TG 

. 10701 CTGCGTGAGG GTAGiWCTGGA AGTCATCCAT GTCCACAAAG CGGTGGTATG 
GACGCACTCC CATCTGACCT TCAGTAG6TA CAGGTGTTTC GCCACCATAC 

10751 CGCCCGTGTT GAT6GTGTAA GTGCAGTTGG CCATAACGGA CCAGTTAACG 
GCGGGCACAA CTACCACATO CACGTCAACC GGTATTGCCT GGTCAATTGC 

108 01 GTCTGGTGAC . CCGGCTGCGA GAGCTCGGTG TACCTGAGAC GCGAGTAAGC 
CAGACCACTG CGCCGACCCT CTCGAQCCAC ATGGACTCTG CGCTCATTCG 

10B51 CCTCGAGTCA AATACGTAGT CGTTGCAAGT CCGCACCAGG TACTGGTATC 
G6AGCTCAGT TTATGCATCA GCAACGTTCA GGCGTGGTCC ATSACCATAG 

10901 CCACCAAAAA GTGCGGCGGC GGCTGGCG6T AGAGGGGCCA GCGTAGGGTG 
GGTGGTTTTT GACGCCGCCG CCGACCGCCA TCTCCCCGGT CGCATCCCAC 

10951 GCCG66GCTC OGGGGGCGAG ATCTTCCAAC ATAAGGCGAT GATATCCGTA 
CGGCCCCGAG GCCCCCGCTC TAGAAGGTTG TATTCCGCTA CTATAGGCAT 

11001 GATGTACCTG GACATCCAGG TGATGCCGGC GGCGGTGGTG GAGGCGCGCG 
CTACATGGAC CTGTAGGTCC ACTACGGCC6 CCGCCACCAC CTCCGCGCGC 

11051 GAAAGTCGCG GACGCG6TTC CAGATGTTGC GCAGCGGCAA AAAGTGCTCC 
CTTTCAGCGC CTGCGCCAAG GTCKkCAACG CGTCGCCGTT TTTCACGAGG 

11101 ATGGTCGGGA CGC!nCTGGCC GOTCAGQCGC GCGCAATCGT TGACGCTCTA 
TACCAGCCCT GC6AGACCGG CCAGTCCGCG CGCGTTA6CA ACTGCGAGAT . 

11151 GACCGTGCAA AA6GAGAGCC TGTAAGCGGG CACTCTTCOG TGGTCTGGTG 
CTGGCACGTT TTCCTCTCGG ACATTCGCCC GTGAGAAGGC AOCAGACCAC 

11201 CSATAAATTCG CAAGGGTATC ATGGCGGACG ACCGGGGTTC 6AGCCCCGTA 
CTATTTAAGC GTTCCCATAfi TACCGCCTGC TGGCCCCAAG CTCGGGGCAT 

11251 TCCGGCCGTC CGCCGTGATC CATGdGGTTA CCGCCCGCGT GTCGAACCCA 
AGGCCGGCAG GCiSGCACTAG GTACGCCAAT GGCGGGCGCA CAGCTTGGGT 

11301 GGTGT6CCAC CTCAGACAAC 6GGGGASTCC TCCTTTTGGC TTCCTTCCAG 
CCACACGCTG CAGTCTGTTG CCCCCTCAC6 AOGAAAACCG AAGGAAGGTC 
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11351 GCGCGOCGGC '^^pGCGCTA GCTTTTTTGG CCACTGGCCG CGCGC/^fcx 
. CGCGCCGCC6 ^^CGCG^IT CGi^^kA^^CC G6TGACC6GC GCGCGtIRa 

11401 AAGCGGTTAG GCTGGAAAGC GAAAGCATTA AGTGGCTCGC TCCCTGTAGC 
TTCGCCAATC CGACCTTTC6 CTmrGTAAT TCACCGAGCG AGGGACATC6 

11451 CGGAGGGTTA TTTTCCAAGG GTTGAGTCGC GGGACCCCCG GTTCGAGTCT 
GCCTCCCAAT AAA^GGTTCC CAACTCAGCG CCCTG6GGGC CAAGCTCAGA 

11501 CGGACCGGCC GGACTGCGGC GAACGGGGGT TTGCCTCCCC GTCATGCAAG 
GCC7GGCCGG CCTGACGCCG CTTGCCCCCA AACGGA6GGG CAGTACGTTC 

11551 ACCCCGCTTG CAAATTCCTC CGGAAACA6G GACGAGCCCC TTTTTTGCTT 
TOGGGCGAAC GTTTAAGGAG GCCTTTGTCC CTGCTCGGGG AAAAAACGAA 

11601 rrCCCAGATG CATCCGGTGC T6CGGCAGAT GCGCCCCCCT CCTCAGCAGC 
AA6GG7CTAC GTAGGCCACG ACGCCGTCTA CGCG6GG6GA GGAGTCGTCG 

11651 GGCAAGAGCA AGAGCAGCGG CAGACATGCA GGGCACCCTC CCCTCCTCCT 
CCGTTCTC6T TCTCGTCGCC GTCTGTACGT CCCGTGGGAG GQGAGGAGGA 

11701 ACCGCGTCAG GAGGGGCGAC ATCCGCGGTT GACGCGGCAG CAGATGGTGA . 
TGGCGCAGTC CTCCCCGCTG TA6CCGCCAA CTGCGCCGTC GTCTACCACT 

11751 TTACGAACCC CCGCGGCGCC GGGCCCGGCA CTACCTGGAC TTGGAGGAGG 
AATGCTTGGG GGCGCCGCGG CCCGGGCCGT GATGGACCTG AACCTCCTCC 

11801 GCGAGGGCCT GGCGCGGCTA GGAGCGCCCT CTCCTGAGCG GCACCCAAGG 
CGCTCCCGGA CCGCGCCGAT CCTCGC6GGA GAGGACTCGC CGTGGGTTCC 

11851 GTGCA6CTGA AGCGTGATAC GCGTGAGGCG TACGTGCCGC GGCAGAACCT 

cacgtcgact ?cgcactatg cgcactcogc atgcacggcg cx:gtcttgga. 

11901 6TTTCGCGAC CGCGAGGGAG AGGAGCCCGA GGAGATGCGG GATCGAAAGT 

caaagcgctg gcgctccctc tcctcgggct cctctacgcc ctagctttca 

11951 tccacgcagg gcgcgagctg cggcatggcc tgaatcgcga gcggttgctg 
aggtgcgtcc cgcgctcgac gccgtaccgg acttagcgct cgccaacgac , 

12001 cgcgaggagg actttgagcc cgacgcgcga accgggatta gtcccgcgcg 

GC6CTCCTCC TGAAACTCGC GCTGCGCGCT TGCCCCTAAT CAGGGCGCGC 

12051 CGCACACGTG GCGGCCGCCG ACCTGGTAAC CGCATACGAG CAGACGGTGA 
GCGTGTGCAC CGCCGGCGGC TGGACCATTG GCGTATGCTC GTCTGCCACT 

12101 ACCAGGAGAT TAACTTTCAA AAAAGCTTTA ACAACCACGT GCGTACGCTT 
TGGTCCTCTA ATTGAAAGTT TTTTCGAAAT TGTTGGITGCA CGCATGCGAA 

12151 GTGGCGCGCG AGGAGGTGGC TATAGGACTG ATGCATCTGT GGGACTTTGT 
CACXGCGCGC "TCCTCCACCG ATATCCTGAC TACGTAC3ACA CCCTGAAACA 

12201 AAGCGCGCTG 6AGCAAAACC CAAATAGCAA GCCGCTCATG GCGCAGCTGT 
TTCGCGCGAC CTCGTTTTGG GTOTATCGTT - CGGCSAGTAC CGCGTCGACA 

12251 TCCTTATAGT GCAGCACAGC AGGGACAACG AGGCATTCAG GGATGCGCTG 
AGGAATATCA CGTCGTGTCG TCCCTGTTGC TCCGTAAGTC CCTACGCGAC 
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12301 CTAAACATAG TBpGCCCGA GGGCCGCTGG CTGCTCGATT TGATAAMfT 
GATTTGTATC aWcGGQCT CCCGGCGACC GACGAOCTAA ACTATTOTTA 

* 

12351 CCTGCAGAGC ATAGTGGTGC AG6AGCGCAG CTTGAGCCTG GCTGACAAGG 
GGACGTCTCG TATCACCAC6 TCCTCGCGTC GAACTCGGAC CGACTGTTCC 

12401 TGGCCGCCAT CAACTATTCC ATGCTTAGCC TG6GCAAGTT TTACGCCCCC 
ACCCGCGCTA GTTGATAAGG TACGAATCGG ACCCGTTCAA AATGCGGGCG 

12451 AAGATATACC ATACCCCTTA CGTTCCCATA GACAAGGAGG TAAAGATCGA 
TTCTATATGG TATGGGGAAT GCAAG6STAT CTGTTCCTCC ATTTCTAGCT 

12501 GGCGTTCTAC ATCCGCATGG CGCTGAAGGT GCTTACCTTG AGCGACX3ACC 
CCCCAAGATG TACGCGTACC GCGACTTCCA CGAATGGAAC TCGCTGCTGG . 

12551 TCCGCCTTTA TCGCAACGAG CGCATCCACA AGGCCGTGAG C6TGAGCCGG 
ACGCGCAAAT AGCGTTGCTC GCGTAGGTGT TCCGCCACTC GCACTCGGCC 

12601 CGGCGCGAGC rCAGCGACCQ CGAGCTGATG CACAGCCTGC AAAGGGCCCT 
GCCGCGCTCG AGTCGCTGOC GCTCGACTAC GTGTCGGACG TTTCCCGGGA 

12651 GGCTGGCACG GGCAGCGGCG ATAGAGAGGC CGAGTCCTAC TTT6ACGCGG 
CCGACCGTGC CCGTCGCCOC TATCTCTCCG GCTCAGGATG AAACT6CGCC 

12701 GCGCTGACCT GCGCTOGGCC CCAAGCXIGAC GCGCCCTGGA GGCAGCTGGG 
CCCGACTGGA CGCGACCCGG GGTTCGGCTG CGCGGGACCT CCGTC6ACCC 

12751 GCCGGACCTG GGCTGGCGGT GGCACCCGC6 CGCGCTGGCA ACGTCGGC66 
CGGCCTGGAC CCGACCGCCA CCGTGGGCGC GCGCGACCGT TGCAGCCGCC 

12801 CGTGGAGGAA TATGACGAGG ACGATGAOTA CGAGCCAGAG GACGGCGAGT 
GCACCTCCTT ATACTGCTCC TGCTACTCAT GCTCGGTCTC CTGCCGCTCA 

12B51 ACTAAGCGGT GATGTTTCTG ATCAGATGAT GCAAGACGCA ACGGACCCGG 
. TGATTCGCCA CTACAAAGAC TAGTCTACTA CGTTCTGCGT TGCCTGGGCC 

12901 CGGTGCGGGC GGCGCTGCAG A6CCAGCCGT CCGGCCTTAA CTCCACGGAC 
GCCACGCCCG CCGCGACCTC TCGGTCGGCA GGCCGGAATT GAGGTGCCTG 

12951 GACTGGCGCC AGGTCATCGA CCGCATCATG TC6CTGACTG CGCGCAATCC 
• CTGACCGCGG TCCAGTACCT GGCGTAGTAC AGCGACTGAC 6CGCCTTAGG 

13001 TCACGCGTTC CGGCAGCAGC CGCAG6CCAA CCGGCTCTCC GCAATTCTGG 
ACTGCGCAAG GCCGTCGTCG GCGTCCG6TT G6CCGAGAQG CGTTAAGACC 

13051 AAGCGGTGGT CCCGGCGCGC GCAAACCCCA C6CACSAGAA 6GTGCTGGCG 
TTCGCCACCA GGGCCGCGCG CGTTTGGGGT GCGTOCTCTT CCACGACC6C 

13101 ATCGTAAACG CGCTGGCCGA AAACAGGGCC ATCCOGCCCG ACGAGGCCGG 
TAGCATTTGC GC6ACCGGCT TTTGTCCCGG TA6GCCGGGC TQCTCCQGCC 

13151 CCreGTCTAC GACGCGCTGC TTCAGCGCGT GGCTCGTTAC AACAGCGGCA 
GGACCAGATG CTGCGC6ACG AA6TCGCGCA CXTGAGCAATG TTGTCGCCGT 

13201 ACGTGCAGAC CAACCTGGAC CGGCTGGTGG GGGATGTGCG CGAGGCCGTG 
TGCACGTCTG GTTGGACCTG GCCGACCACC CCCTACACGC GCTCCGGCAC 
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13251 GC6CAQCGTG >^^GCGCA GCAGCAOGGC AACCTGGGC7 CX:ATG(aH|C 
CGCGTCGCAC tcIcGCGCGT CGTCGTCCCG TTGGftCCCOA GGTACollcG 

13301 ACTAAACGCC TICCTGAGTA CACAGCCCGC CAACGT6CCG CXKSK^CAGG 
TGATTTGCGG AAGGACTCA7 GTGTC6SGCG GTTGCACGGC GCCCCTGTCC 

13351 AGGACTACAC CAACTTTGTG AGCGCACT6C GGCTAATGGT GACTGAGACA 
TCCTGXTGT6 GITGAAACAC TCGC6TGACG CC6ATTACCA CTGACTCTGT 

13401 COGCAAAGTG AGGTGTACCA GTCTGGGCCA GACTATTTTT TCCAGACCAG 
6GCGTTTCAC TCCACATGGT CAGACCXrGGT CTGATAAAAA AGGTCTGGTC 

13451 TAGACAAGGC CTOCAGACCG TAAACCTGAG CCAGGCTTTC AAAAACTTGC 
ATCTGTTCC6 GACGTCTGGC ATTTGGACTC GGTCCGAAA6 TTTTTGAACG 

.13501 AGGGGCTGT6 GGGGGTGCGG GCTCCCACAG GCGACCGCGC GACCGTGTCT 
TCCCCGACAC CTCCCACGCC CGAGGGTGTC CGCT0GCGC6 CTGGCACAGA 

13551 AGCTTGCTGA CGCCCAACTC GCGCCTGTTG CTGCTGCTAA TAGCGCCCTT 
TCGAACGACT GCG6GTTGAG CGCGGACAAC GACGACGATT ATCGCGGGAA 

13601 CACGGACA6T GGCAGCGTGT CCCGGGACAC ATACCTAGGT CACTTGCTGA 
GTGCCTGTCA CCGTCGCACA GGGCCCTGTG TATGGATCCA GTCAAC6ACT 

13651 CACTGTACCG C6AGGCCATA GGTCAGGC6C ATGTGGACGA GCATACTTTC 
GTGACATGGC GCTCCGGTAT CCAGTCCGCG TACACCTGCT CGTATGAAAG 

13701 CAGGAGATTA CAAGTGTCAG CCGCGCGCTG GGGCAG6AGG ACACGGGCAG 
GTCCTCTAAT GTTCACAGTC GGCGCXXTGAC CCCGTCCTCC TGTGCCCGTC 

13751 CCTGGAGGCA ACCCTAAACT ACCTGCTGAC CAACCGGCGG CAGAAGATCC 
GGACCTCCCT TCGGATTTGA- TGGACGACTO GTTGGCCGCC GTCTTCTAGG 

13801 CCTCGTTGCA CAGTTTAAAC AGCGAGGAGG AGCGCATTTT GCGCTACGTG 
GGAGCAACGT GTCAAATTTG TCGCTCCTCC TCGCGTAAAA CGCGATGCAC 

13851 CAGCAGAGCG TCAGCCTTAA CCT6ATGCGC GACGGGGTAA CGCCCAGCGT 
GTC6TCTCGC ACTCGGAATT GGACTACQCG CTGCCXTCATT GCGGGTCGCA 

13901 GGCGCTGGAC ATGACCGCGC 6CAACATGGA ACCGGGCATG TATGCCTCAA 
CCGCGACCTG TACTOGCGCG CGTTGCACXrT TGGCCXTGTAC ATACGGAGTT 

13951 ACCGGCC6TT TATCAACCGC CTAATGGACT ACTTGCATCG CGCGGCCGCC 
TGGCCGGCAA ATAGTTGGCG GATTACCIGA TGAACGTAGC GCGCCGGCG6 

14001 GTGAACCCCG AGXATTTCAC CAATGCCATC TTGAACCCGC ACTG6CTACC 
CACTT6GGGC TCATAAAGTG GTTACG6TAG AACTTGGGC6 TGACCGATGG 

14051 GCCCCCTGGT TTCTACAOCG GGGGATTCGA GGTQCCCGAG GGTAACGATC 
CGGGGGACCA AAGATGTGGC CCCCTAAGCT CCAC6GGCTC CCATTGCTAC 

14101 GATTCCTCTG GGACGACATA GACGACAGCG TGTTTTCCCC GCAACC6CAG 
CTAAGGAGAC CCT6CTGTAT CTGCTGITCGC ACAAAAG6GG CGTTGGCGTC 

14151 ACCCTGCTAG AGTTGCAACA GCGCGAGCAG GCAGAGGCGG CGCTGCGAAA 
TGGGACGATC TCAACGTT6T CCCGCTCGTC CGTCTCCGCC GC6ACGCTTT 
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14201 GGAAAGCTTC Mj^GGCCAA GCAGCTTCTC CGATCTAGtJCr-iSCfl^C&ljCC 
CCTTTC6AA0 qBtCCGGTT CGTCGAACAG : GCTAGATCCG CGACGC^G 

14251 CGCGGTCAGA TGCTAGTAGC CCATTTCCAA GCTTGATAGG GTCTCTTACC 
GCGCCAGTCT ACGATCATCG GGTAAA6GTT CGAACTATCC CAGAGAATGG 

14301 AGCACTCOCA CCACCC6CCC GC6CCTGCTG GGCGAGGAGG A6TACCTAAA 
ICGTGAGCGT GGTGGGCGGG CGCGGACGAC CCGCTCCTCC TCATGGATTT 

14351 CAACTCGCTG CTGCAGCCGC AGCGCGAAAA AAACCTGCCT CCGGCATTTC 
GTTGAGCGAC GACGTCGQCG TCGCGCTTTT TTTGGACGGA GGCCGTAAAG 

14401 OCAACAACGG GATAGAGAGC CTAGTOGACA AGATGAGTAG ATOGAAGACG 
6GTTGTTGCC CTATCTCTCG GATCACCTGT TCTACTCATC TACCTTCTGC 

14451 TACGCGCAGG AGCACAGGGA CGTGCCAGGC CCGC6CCCGC CCACCCGTCG 
ATGCGCGTCC TCGTGTCCCT GCACGCTCCG CGCGCGGQCG GGTGGGCAGC 

14501 TCAAAGGCAC GACCGTCAGC GGGGTCTGGT GTGGGAGGAC GATGACTCGG 
AGTTTCCGTG CTGGCAGTCG CCCCAGACCA CACCCTCCTG CTACTGAGCC 

14551 CAGACGACAG CAGCGTCCTG GATTTGGGAG GGAGTGGCAA CCCGTTTGCG 
GTCTGCTGTC GTCGCAGGAC CTAAACCCTC CCOCACCGTT GGGCAAACGC 

14601 CACCTTCGCC CCAGGCTGGG GAGAATGTTT TAAAAAAAAA AAAAGCATGA 
GTGGAAGCGG GGTCCGACCC CTCTTACAAA ATTTTTTTTT TTTTCGTACT 

14651 TGCAAAATAA AAAACTCACC AAGGCCATGG CACCGAGCGT TGGTTTTCTT 
ACGTTTTATT TTTTGAGTQG TTCCGGTACC GTGGCTCGCA ACCAAAAGAA 

14701 GTATTCCCCT TAGTATGCGG CGCGCGGCGA TGTATGAGGA AGGTCCTCCT 
CATAAGGCCA ATCATACGCC GCGCGCCGCT ACATACTCCT TCCAGGAGGA 

14151 CCCTCCTACG AGAGTGTGGT 6AGCGCGGCG CCAGTGGCGG CGGCGCTGGG 
GGGAGGATGC TCTCACACCA CTCGC6CCGC GGTCACCGCC GCCGCGACCC 

14801 TTCTCCCTTC GATGCTCCCC TGGACCCGCC GTTTGTGCCT CCGCGGTACC 
AAGAGGGAAG CTACGAGGGG ACCTGGGCGG CAAACACGGA GGCGCCATGG 

14851 TGCCGCCTAC CGGGGGGAGA AACAGCATCC GTTACTCTGA GTTGGCACCC 
ACGCCGGATG GCCCCCCTCT TTGTCGTAGG CAATGAGACT CAACCGTGGG 

14901 CTATTCGACA CCACCCGTGT GTACCTGGTG GACAACAAGT CAACGGATGT 
GATAAGCTGT GGTGGGCACA CATCSACCAC CTGTTGTTCA GTTGCCTACA 

14951 GGCATCCCTC AACTACCAGA AC6ACCACAG CAACTTTCTG ACCACGGTCA 
CCGTAGGGAC TT6ATGGTCT TGCTGGTGTC GMGAAAGAC TGGTGCCAGT 

15001 TTCAAAACAA TGACTACAOC CC6GGSGAGG CAAGCACACA GACCATCAAT 
AAGTTTTGTT ACTGATGTCG GGCCCCCTCC GTTCGTGTGT CT0GTAGTTA 

15051 CTTGACGACC GGTCGCACT6 GGGCGC3CGAC CTGAAAACCA TCCTGCATAC 
GAACTGCTGG 0CA6CGTGAC CCCGCPGCTG GACTTTTGGT AGGACGTATG 

151D1 CAACAT6CCA AATGIXSAACG AGTTCATGTT TACCAATAAG TTTAAGGCGC 
GTTGTACGGT TTACACTTGC TCAACTACAA ATGGTTATTC AAATTCCGCG 
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15151 GGGTGATGGT d^^CGCTTG CCTACTAAGG ACAATCAGGT GGAGCT^LA 
CCCACTACCA CAGCGCGAAC OGAIKSATTCC TCTTAGTCCA CCTCGACTTT 

15201 TACGAGTGGG TGCAGTTCAC 6CT6CCCGAG GGCAACTACT CCGAGACCAT 
ATGCTCACCC ACCTCAAGTG CGACGGGCTC CCGTTGATGA GGCTCTGGTA 

15251 GACCATAGAC CTTATGAACA ACGCGATCGT GGAGCACTAC TTGAAAGTGG 
CTGGTATCTG GAATACTTGT TGCGCTAGCA CCTCGTGATG AACTTTCACC 

15301 GCAGACAGAA CGGGGTTCTG GAAAGCGACA TCGOGGTAAA GTTTGACACC 
CGTCTGTCTT GCCCCAAGAC CTTTCGCTGT AGCCCCATTT CAAACTGTGG 

15351 CGCAACTTCA GACTGGGGTT TGACOCCGTC ACTGGTCTTG TCATGCCTGG 
6CGTTGAAGT CTGACCCCAA ACTGGGGCAG TGACCAGAAC AGTACGGACC 

15401 GGTATATACA AACGAAGCCT TCCATCCAGA CATCATTTTG CTGCCAGGAT 
CCATATATGT TTGCTTCCGA ACCTAGCTCT GTAGTAAAAC GACGGTCCTA 

15451 GCGGGGTGGA CTTCACCCAC AGCCGCCTGA GCAACTTGTT GGGCATCCGC 
CGCCCCACCT GAAGTGGGTG TCGGCGGACT CGTTGAACAA CCCGTAGGCG 

15501 AAGCGGCAAC CCTTCCAGGA GGGCTTTAGG ATCACCTACG ATGATCTGGA 
TTCGCCGTTG GGAAGGTCCT CCCGAAATCC TAGT6GATGC TACTAGACCT 

15551 GGGTGGTAAC ATTCCCGCAC TGTTGGATGT GGACGCCTAC CAGGCGAGCT 
CCCACCATTG TAAGGGCGTG ACAACCTACA CCTGCGGATG GTCCGCTCGA 

15601 TGAAAGATGA CACCGAACAG GGCGGGGGTG GCGCAGGCGG CAGCAACAGC 
ACTTTCTACr GTGGCTTGTC CCGCCCCCAC CGCGTCCGCC GTCGTTGTCG 

15651 AGTGGCAGCG GCGCGGAAGA GAACTCCAAC GCGGCAGCCG CC3GCAATGCA 
TCACCGTCGC CGCGCCTTCT CTTGAGGTTG CGCCGTCGGC GCCGTTACGT 

15701 GCCGGTGGAG GACATGAACG ATCATGCCAT TCGCGGCGAC ACCTTTGCCA 
CGGCCACCTC CTGTACTTGC TAGTACGGTA AGCGCCGCTG TGGAAACGGT 

15751 CACGGGCTGA GGAGAAGCGC GCTGAGGCCG AAGCAGCGGC CGAAGCTGCC 
GTGCCCGACT CCTCTTCGCG CGACTCCGGC TTCGTCGCC6 GCTTCGACGG 

15801 GCCCCCGCTG CGCAACCCGA GGTCGAGAAG CCTCAGAAGA AACCGGTGAT 
CGGGGGCGAC 6CGTTGGGCT CCAGCTCTTC GGAGTCTTCT TTGGCCACTA 

15B51 CAAACCCCTG ACAGAGGACA GCAAGAAACG CAGTTACAAC CTAATAAGCA 
GTTTGGGGAC TGTCTCCTGT CGTTCTTTGC GTCAATGTTG GATTATOCGT 

15901 ATGACAGCAC CTTCACCCAG TACCGCAGCT GGTACCTTGC ATACAACTAC 
TACTGTCGTG GAAGTGGGTC ATGGCGTCGA CCATGGAAC6 TATGTTGATG 

15951 GGCGACCCTC AGACCGGAAT CCGCTCATGG ACCCTGCTTT GCACTCCTGA 

CCGCTG6GAG TCTGGCCTTA GGCGAGTACC TGGGACGAAA CGTGAGGACT . 

160.01 CGTAACCTGC. GGCTCGGAGC A6GTCTACT6 GTCGTTGCCA GACATGATGC 
6CATTGGACG CCGAGCCrcG TCCAGATGAC CAGCAACGGT CTGTACTACG 

16051 AAGAGCCCGT GACCTTCCGC TCCACGCGCC AGATCAGCAA CTTTCCGGTG 
TTCTGGGGCA CTGGAAGGCG AGGTGCGCGG TCTAGTCGTT 6AAAGGCCAC 
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16101 GTGGGCGCCG ^^fGTTGCC CGTGCACTCC AAGAGCTTCl^CSAC^j|CA " 

• cAcccGCGGC otRcaacgg gcacgtgagg ttctcgaaqa tgttgcWIt 

16151 GGCCGTCTAC TCCCAACTCA TCCGCCAGTT TACCTCTCTG ACCCACGTGT 
CCGCCAGATG AGGGTTGAGT AGGCGGTCAA ATG6AGAGAC TG6GTGCACA 

16201 TCAATCGCTT TCCCGAGAAC CAGATTTTGG CGCGCCCGCC AGCGCCCACC 
AGTTACCGAA AGGGCTCTT6 GTCTAAAACC GCGCGGGCGG TCGGGGGTGG 

16251 ATCACCACXTG TCAGTGAAAA CGTTCCTGCT CTCACAGATC ACGGGACGCT 
TAGTGGTG6C AGTCACTTTT CCAAC6AC6A GAGTGTCTA6 TGCCCTGCQA 

16301 ACCGCTGCGC AACAGCATCG GAGGAGTCCA GCGAGTGACC ATTACTGACG 
TGGCGACGCC TTGTCGTAGC CTCCTCA6GT CGCTCACTGG TAATGACTGC 

16351 CCAGACX3CCG CACCTGCCCC TACGTTTACA AGGCCCTGGG CATAGTCTCG 
GGTCTGCGGC GTGGACGGGG ATGCAAATGT TCCGGGACCC GTATCAGAGC 

16401 CCGCGCGTCC OIATCGAGCCG CACTTTTTGA GCAAGCATGT CCATCCTTAT 
GGCGCGCAGG ATAGCTCGGC GTGAAAAACT CGTTCGTACA GGTAOGAATA 

16451 ATCGCCCAGC AATAACACAG GCTGGGGCCT GCGCTTCCCA AGCAAGATGT 
TAGCGGGTCG TTATTGTGTC CGACCCC6GA CGCGAA6GGT TCGTTCTACA 

16501 TTGGCGGGGC CAAGAAGC6C TCCGACCAAC ACCCAGTGCG CGTGCGCGGG 
AACCGCCCCC GTTCTTCGCG AGGCTGGTTG TGGGTCACGC GCACGCGCCC 

16551 CACTACCGC6 CGCCCTGGGG CGCGCACAAA CGCGGCCGCA CTGGGCGCAC 
GTGATGGCGC GCGGGACCCC GCGCX5TGTTT GCGCCGGCGT GACCCGCGTG 

16601 CACCGTCGAT GACGCCATCG ACGCGGTGGT GGAGGAGGCG CGCAACTACA 
GTGGCAGCTA CTGCGGTAGC TGCGCCACCA CCTCCTCCGC GCGTTGATGT 

16651 CGCCCACGCC GCCACCAGTG TCCACAGTGG ACGCGGCCAT TCA6ACCGTG 
GCGGGTGCGG" CGGTGGTCAC AGGT6TCACC TGCGCCGGTA AGTCTGGCAC 

16701 GTGCGCGGAG CCCGGCGCTA TGCTAAAATG AAGAGACGGC GGAGGCGCGT 
CACGCGCCrC GGGCCGCGAT ACGATTTTAC TTCTCTGCCG CCTCCGCGCA 

16751 AGCACGTCGC CACCGCCGCC GACCCGGCAC TGCCGCCCAA CGCGCGGCGG 
OtGTGCAGCG GTGGCGGCGG CTGGGCCGTG ACGGCGGGTT GCGCGCCGCC 

16801 CGGCCCTGCT TAACC6CGCA CGTC6CACCG GCCGACGGGC GGCCATGCGG 
GCCGGGACGA ATTGGCGCGT GCAGCGTGGC CGGCTGCCCG CC6GTACGCC 

16 851 6CCGCTCGAA GGCTOQCCGC GGGTATTGTC ACTGTGCCCC CCAGGTCCAG 
CGGCGAGCTT CCGACCGGCG CCCATAACAG TGACACGG6G GGTCCAGGTC 

16901 GCGACGAGCG GCCGCCGCAG CAGCCGCG6C CATTAGTGCT ATGACTCAGG 
CGCTGCTCGC CGGCG6CGTC CTCGGCGCCG 6TAATCACGA TACTCAGTCC 

16951 GTCQCAGGGG CAACGTGTAT TGGGTGCGCG ACTCGGTTAG CGGCCTGCGC 
CAGCGTCCCC GTTGCACATA ACCCACGC6C TGAGCCAATC GCCG6ACGCG 

17001 CTOCCCGTGC GCACCCGCCC CCCGCGCAAC TAGATTGCAA G AAAAAAC TA 
CACQGGCACG CGTGGGCGGG OGGCGCGTTC ATCTAACCTT CTTTTTTGAT 



72/144 



wo 02/22080 



PCT/US01/2S861 



1705X CTTAGACTCG «p6TTGTA TGTATCCAGC WCQQCWfJd UiCAAU^pii 
CAATCTGAGC A^RcAACAT ACATAGGTCG CCGCCGCCGC GCGTTG^Sc 

17101 CTATCTCCAA 6CGCAAAATC AAAGAASAGA TGCTCCAGGT CATCGCGCCG 
GATACA6GTT CGCGTTTTAG TTTCTTCTCT ACGAGGTCCA GTAGCGCGGC 

17151 GAGATCTATG GCCCCCCGAA GAAGGAAGAG CAGGATTACA AGCCCC6AAA 
CTCTAGATAC CQGGGGGCTT CTTCCTTCTC GTCCTAAT6T TCGGGGCTTT 



17201 GCTAAAGCGG 6ICAAAAAGA AAAAGAAAGA TGATGATGAT GAACTT6ACG 
CGATTTCGCC CAGTTTTTCT TTTTCTTTCT ACTACTACTA CTTGAACTGC 

17251 ACGAGGTGGA ACTGCTGCAC GCTACCGCGC CCAGGCGAC6 GGTACAGTG6 
TGCTCCACCT TGACGACGTG CGATGQCGC6 GGTCCGCT6C CCATGTCACC 

173 01 AAAOGTCGAC GCGTAAAACG TGTTTTGCGA CCCGGCACCA CCCTACTCTT 
TTTCCAGCTG CGCATTTTGC ACAAAACGCT GGGCCGTGGT GGCATCA6AA 

17351 TACGCCCGGT GAGCGCTCCA CCCGCACCTA CA^GCGCGTG TAT6ATGAG6 
AT6CGGGCCA CTCGCGAGGT GGGCGTGGAT GTTCGCGCAC ATACTACTCC 

17401 TGTACGSC6A CGA6GACCTG CTT6AGCA66 CCAAC6AGC6 CCrCGGGGAG 
ACATGCCGCT QCTCCTGGAC GAACTCGTCC GGTTGCTCGC GGAGCCCCTC 

17451 TTTGCCTACG GAAAGCGGCA TAA6GACAT6 CTGGCGTTGC CGCTGGACGA 
AAACG6ATGC CTTTCGCCGT ATTCCT6TAC GACCGCAACG GCGACCTGCT 

17501 G6GCAACCCA ACACCTAGCC TAAAGCCCGT AACACTGCAG CAGGTGCTGC 
CCCGTTGGGT TCTGGATCGG ATTTCGGGCA TTGTGACGTC GTCXACGACG 

17551 CCGCGCTtGC ACCGTCC6AA CAAAAGCSCG GCCTAAAGCG CGAGTCTGGT 
GGCGCGAACG TGGCAGGCTT CTTTTCGCGC CGGATTTCGC GCTCAGACCA 

17601 GACTTGGCAC CCACCGTGCA GCTGATGGTA CCCAAGCGCC AGCGACTGGA 
CTGAACCGTG 6GTGGCACGT CGACTACCAT GGGTTCGCGG TCGCTGACCT 

17651 AGATGTCTTG GAAAAAATGA CCGTGGAACC TGGGCTGGAG CCCGAGGTCC 
TCTACAGAAC CTTTTTTACT GGCACCTTGG ACCCGACCTC GGGCTCCAGG 

17701 GC6TGCGGCC AATCAAGCA6 GTG6C0CCGG GACTGGGCGT GCAGACCGTG 
CGCACGCC6G TTAGTTCGTC CACCGCG6CC CTGACCCGCA CGTCTGGCAC 

w 

m 

17751 GACGTTCAGA TACCCACTAC CAGTAGCACC AGTATTGCXA CCGCCACAGA 
CTGCAAGTCT ATGGGT6ATG GTCATCGTGG TCATAACGGT GGCGGTGTCT 

17801 GGGCATGGAG ACACAAACOT CCCCGGTTGC CTCAGC6GTG GCG6ATGCCG 
CCCGTACCTC TGTGTTTCCA GGGGCCAACG GAGTCGCCAC CGCCTACGGC 

17851 CGGTGCAG6C GGTCGCTGCG GCCGCGTIXA A6ACCTCTAC GGAGGTGCAA 
GCCACGTCCG CCAGCGACGC CGGCGCAOGT TCTGGAGATG CCTCCACGTT 

17901 ACGGACCCGT GGATGTTTC6 COTTTCAQCC CCCCGGCGCC CGC6CCGTTC 
TGCCTGGGCA CCTACAAAGC GCAAAGTCGG GGG0CCGCG6 GCGCGGCAAG 

17951 GAGGAAGTAC GGOGCCGCCA GCGCGCTACT GCCCGAATAT QCCCTACATC 
CTCCTTCATC CCQCGGCGGT CQCGCa^TGA CGGGCTTATA CGGGATGTAG 
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18001 CTTCCATTGC G^TACCCCC GGCTATCGTG GCTACACOrfiS^OGeC^^A 
CAAGGTAACG d^^TGGGGG CCGATAGCAC CGATGTGGAT GGCGGCS^^T 

18051 AC3ACGAGCAA CTACCCGACG CXGAACCACC ACTGGAACCC GCCGCCGCCG 
TCTGCTCGTT GATGGGCTGC GGCTTGGTGG TGACCTTGGG CGGCGCCGGC 

18101 TCGCCGTCGC CAGCCCGTGC TGGCCCCGAT TTCCGTGCGC AGGGTGGCTC 
AGCGGCAGCG GTC6GGCACG ACCGGGGCTA AAGGCACGCG TCCCACCGAG 



18151 GCGAAGGAGG CAGGACCCTG GTGCTGCCAA CAGCGCGCTA CCACCCCAGC 
CGCTTCCTCC GTCCTGGGAC CACGACGGTT GTCGCGCGAT GGTGGGGTCG 

18201 ATC6TTTAAA AGCCGGTCTT TGTGGTTCTT GCAGATATGG CCCTCACCTG 
TAGCAAATTT TCGGCCAGAA ACACCAAGAA CGTCTATACC GGGAGTGGAC 

1B251 CCGCCTCCGT TTCCCGGTGC CGGGATTCCG AGGAAGAATG CACCGTAGGA 
GGCGGAGGCA AAGGGCCACG GCCCTAAGGC TCCTTCTTAC GTGGCATCCT 

18301 GGGGCATGGC CGGCCACGGC CTGACGGGCG GCATGCGTCG TGCGCACCAC 
CCCCGTACCG GCCGGTGCCG GACTGCCCGC CGTACGCAGC ACGCGTGGTG 

1B351 CGGCGGCGGC GCGCGTCGCA CC6TCGCATG CGCGGCGGTA TCCTGCCCCT 
GCCGCCGCCG CGCGCAGCGT GGCAGCGTAC GCGCCGCCAT AGGACGGGGA 

18401 CCTTATTCCA CTGATCGCCG CGGCGATTGG CGCCGTGCCC GGAATTGCAT 
GGAATAAGGT GACTAGC6GC GCCGCTAACC GCGGCACGGG CCTTAACGTA 

18451 CCGTGGCCTT GCAGGCGCAG AGACACTGAT TAAAAACAAG TTGCATGTGG 
GGCACCGGAA CGTCCGCGTC TCTGTGACTA ATTTTTGTTC AACGTACACC 

18501 AAAAATCAAA , ATAAAAAGTC TGGACTCTCA CGCTCGCTTG GTCCTGTAAC 
TTTTTAGTTT TATTTTTCAG ACCTGA3AGT GCGAGCGAAC CAGGACATTG 

18551 TATTrTGTAG AATGGAAGAC ATCAACTTTG CGTCTCTGGC CCCGCGACAC 
ATAAAACATC TTACCTTCTG TAGTTGAAAC GCAGAGACCG GGGCGCTGTG 

18601 GGCTCGCGCC CGTTCATGGG AAACTGGCAA GATATCGGCA CCAGCAATAT 
CCGAGCGCGG GCAAGTACCC TTTGACCGTT CTATAGCCGT GGTCGTTATA 

18651 GAGCGGTGGC GCCTTCAGCT GGGGCTCGCT GTGGAGCGGC ATTAAAAATT 
CTCGCCACCG CGGAAGTCGA CCCCGAGCGA CACCTCGCCG TAATTTTTAA 

18701 TCGGTTCCAC CGTTAAGAAC TATGGCAGGA AGGCCTGGAA CAGCAGCACA 
AGCCAAGGTG GCAATTCTTG ATACCGTCGT TCC6GACCTT GTCGTCGTGT 

18751 GGCCAGATGC TGAGGGATAA GTTGAAAGAG CAAAATTTCC AACAAAAGGT 
CCGGTCTACG ACTCCCTATT CAACTTTCTC CTTTTAAA6G TTGTTTTCCA 

18801 GGTAGATGGC CTGGCCTCTG GCATTAGCGC GGTGGTGGAC CTGGCCAACC 
CCATCTACCG GACCGGAGAC CCTAATC6CC CCACCACCTG GACCGGTTGG 

■ 

18851 AGGCAGTGCA AAATAAGATT AACAGTAAGC TTGATCCCCG CCCTCCCGTA 
TCCQTCACGT TTTATTCTAA TTGTCATTCG AACTAGG6GC GGGAGGGCAT 

18901 GAGGAGCCTC CACCGGCCGT GGAGACAGTG TCTCCAGAGG GGCGTGGCGA 
CTCCTCGGAG GTGGCCGGCA CCTCTGTCAC AGAGGTCTCC CCGCACCGCT 
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18951 MJiGCGTCCG C^CCGACA GGGAAGAAAC TCTGGTGACG"CAAATA|H6 
TTTCGCAGGC G^BGGCTGT CCCTTCTTTC AGACCACTGC GTTtATSiic 

19001 AGCCTCCCTC GTACGAGGAG GCACTAAAGC AAGGCCTGCC CACCACCCGT 
TCGC5&GGGAG CATGCTCCTC CGTGATTTCG TTCCGGACGG GTGGTGGGCA 

19051 CCCATCGCGC CCATGGCTAC CG6AGTGCTG GGCCAGCACA CACCCGTAAC 
GGGTAGCGCG GGTACCGATG 6CCTCACGAC CCGGTCGTGT GTGGGCATTG 



19101 GCTGGACCTG CCTCCCCCCG CCGACACCCA GCAGAAACCT CTGCTGCCAG 
CGACCT6GAC 6BAGGG0GGC 6GCTGTGGGT CGTCTTTGGA CACGACGGTC 

19151 GCCCGACCGC C3TTGTTGTA ACCCGTOCTA GCCGCGCGTC CCTGCGCCGC 
CGGGCT6GCG 6ZAACAACAT TGGGCAGGAT COGCGCGCAG GGACGCGGCG 

19201 GCCGCCAGCG GTCCGCGATC GTTGCGGCCC GTAGCCA07G GCAACTGGCA 
C6GCGGTCGC CAGGC6CTAG CAAC6CCGGG. CATCGGTCAC CGTTGACCGT 

19251 AAGCACACTG AACAGCATCG TG6CTCTGGG GGTGCAATCC CTGAAGCGCC 
TTCGTGT6AC TTGTCGTAGC ACCCAGACCC CCACGTTAGG GACTTCGCGG 

19301 GACGATGCTT CTGATAGCTA ACGT61CGTA TGT6TGTCAT GTATGCGTCC 
CTGCTACGAA GACTATCGAT TGCACAGCAT ACACACAGTA CATACGCAGG 

19351 ATGTC6CCGC CAGAGGAGCT GCTGAGCCGC CGCGCGCCCG CTTTCCAAGA 
TACAGCGGCG GTCTCCTCGA C6ACTCGGC6 GCGCGCGGGC GAAAG6TTCT 

194 01 TGGCTACCCC TTCGATGATG CC6CAGTGGT CTTACATGCA CATCTCGGGC 
ACCGATGGGG AAGCTACTAC GGCGTCACCA GAATGTACGT GTAGAGCCCG 

19451 CAGGACGCCT CGGAGTACCT" GA6CCCCG66 CTGGTGCAGT TT5CCCGCGC 
GTCCTGCGGA GCCTCATGGA CTCGGGGCCC GACCACGTCA AACGGGCGCG 

19501 CACCGAGACG TACTTCAGCC TGAATAACAA GTTTAGAAAC CCCACGGTGG 
GT6GCTCTGC ATGAAGTCGG ACTTATTGTT CAAATCTTTG GGSTGCCACC 

19551 CGCCTACGCA CGACGTGACC ACAGACCGGT CCCAGCGTTT GACGCTGCGG 
GCGGATCCGT GCTGCACTGG TGTCTGGCCA GGGTCGCAAA CTGCGACGCC 

19601 TTCATCCCTG TGGACCGTGA GGATACTGCG TACTCGTACA A6QC6CGGTT 
AAGTAGGGAC ACCTGGCACT CCTATGACGC ATGAGCATGT TCCGCGCCAA 

19651 CACCCTAGCT GTGGGTGATA ACCGTGTGCT GGACAT6GCT TCCACGTACT 
GTGGGATCGA CACCCACTAT OK^SCACACGA CCT6TACCGA AGGTGCATGA 

19701 TTGACATCCG CGGCGTGCT6 GACAGGGGCC CTACTTTTAA GCCCTACTCT 
. AACTGTAGGC GCCOCACGAC CTGTCCCCOG GATG7AAATT CGGGATGAGA 

19751 GGCACTGCCT ACAACOCCCT GGCTCCCAAG GGTGCCCCAA A3CCTTGCGA 
CCGTGACGGA TGTTGCGGGA CCGAG6GTTC CCACG66GTT TAG6AAC6CT 

19B01 ATGGGATGAA GCTGC7ACTG CTCTTGAAAT AAACCTAGAA GAAGAGGACG 
TACCCTACTT CGAC6ATGAC CSA6AACTTTA TTTGGATCTT CTTCTCCTGC 

19851 AT6ACAACGA A6ACGAAGTA GACGAGCAAG CTGAOCAGCA AAAAACTCAC 
TACTGTTGCT TCTGCTTCAT CTGCTCGTTC GACTCGTCGT TTTTTGAGTG 
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19901 . GTATTTCCGC JS^pOCCTTA TTCTGGTATA AATATTACAA AGGAGq^AT 
CATAAACCCG T^GCXSGAAT AAGACCATAT TTATAATGTT TCCTCC^Ra 

> 

19951 TCAAATAGGT GTCGAAGGTC AAACACCTAA ATATGCCGAT AAAACATTTC 
AGTTTATCCA CACCTTCCAG TTTGTGGATT TATACGGCTA TTTTGTAAAG 

20001 AACCTGAACC TCAAATAGGA GAATCTCAGr GGTACGAAAC AGAAATTAAT 

ttggacttgg AGTTTATCCT cttagagtca ccatgctttg tctttaatta 

20051 catgcagctg ogagagtcct aaaaaagact acoccaatga aaccatgtta 
GTACGTCGAC cx:tctcagga ttttttctga tgqggttact ttggtacaat 

20101 CGGTTCATAT QCAAAACCCA CAAATGAAAA TGGAGGOCAA GGCATTCTTG 
GCCAAGTATA CCmTCGGT GTTTACTTTT ACCTCCCGTT CCGTA^lGAAC 

20151 TAAAGCAACA AAATG6AAAG CTAGAAAGTC AAGTGGAAAT GCAATTTTTC 
ATTTCGTTGT TTTACCTTTC GATCTTTCAC TTCACCTTTA CCTTAAAAAC 

20201 TCAACTACTG AGGCAGCCGC AGGCAATGGT GATAACTTGA CTCCTARAGT 
AGTTGATGAC TCCGTCGGCG TCCGTXACCA CTATT6AACT GAGGATTTCA 

20251 GGTATTGTAC AGTGAAGATG TAGATATAGA AACCCCAGAC ACTCATATTT 
CCA7AACAT6 TCACTTCTAC ATCTATATCT TTGGGGTCTG TGAGTATAAA 

20301 CTTACATGCC CACTATTAAG GAA6GTAACT CACGAGAACT AATGGGCCAA 
GAATGTACGG GTGATAATTC CTTCCATTGA GTGCTCTTGA TTACCCGGTT 

20351 CAATCTATGC CCAACAGGCC TAATTACATT GCTTTTAGGG ACAATTTTAT 
GTTAGATACG GGTTGTCCGG ATTAATGTAA CGAAAATCCC TGTTAAAATA 

20401 TGGTCTAATG TATTACAACA GCACGOGTAA TATGGGTGTT CTGGCGGGCC 
ACCAGATTAC ATAATGTTGT CGTGCCCATT ATACCCACAA GACCGCCCGG 

20451 AAGCATCGCA GTTGAATGCT GTTGTAGATT TGCAAGACAG AAACACAGAG 
TTCGTAGCGT CAACTTACGA CAACATCTAA ACGTTCTGTC TTTCTGTCTC 

20501 CTTTCATACC AGCTTTTGCT TGATTCCATT GGTGATAGAA CCAGGTACTT 
GAAAGTATGG TCGJ^AAACGA ACTAAGGTAA CCACTATCTT GGTCCATGAA 

20551 TTCTATGTGG AATCAGGCTG TTGACAGCTA TGATCCAGAT GTTAGAATTA 
AAGATACACC TTAGTCCGAC AACTGTCGAT ACTAGGTCTA CAATCTTAAT 

20601 TTGAAAATCA TGGAACT6AA GAT6AACTTP CAAATTACT6 CTTTCCACTG 
AACTTTTAGT ACCTTGACTT CTACTT6AAG GTTTAATGAC 6AAAGGTGAC 

20651 GGAGGTCTGA TTAATACAGA GACTCTTACC AAGGTAAAAC CTAAAACAGG 
CCTCCACACT AATTATGTCT CTGAGAATGO TTCCATTTTG 6ATTTTGTCC 

207 01 TCAGGAAAAT GGATGGGAAA AAGATGCTAC AGAATOTTCA GATAAAAATG 
AGTCCTTTTA CCraCCCTTT TTCTACGATG TCTTAAAAGT CTATTTTTAC 

20751 AAATAAGAGT TGGAAATAAT TTTGCCATGG AAATCAATCT AAATGCCAAC 
TTTATTCTCA ACCTTTATTA AAACGGTACC TTTAGTTAGA TTTACGGTTG 

20801 CTGTGGAGAA ATTTCCTGTA CTCCAACATA GCGCTGTATT TGCCCGACAA 
GACACCTCTT TAAAGGACAT GACGTTGTAT CGC6ACATAA ACGGGCTGTT 
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20851 GCTAAAGTAQ ;^||CCTTCCA ACGTAAAAAT rrCTGAT;acf^d;S^?itSgtT 
CGATTTCATG ^^BIgAAGGT TGCATTTTTA AAGACTATTG GGTTTd^A 

20901 ACGACTACAT GAACAAGCGA GTGGTGGCTC CCGGGCTAGT GGACTGCTAC 
TGCTGAT(3TA CTTGTTCGCT CACCACXX3A6 GGCCCGATCA CCTGACGATG 

20951 ATTAACCTTG GAGCACGCTG GTCCCrTGAC TATATGGACA ACGTCAACCC 
TAATTGGAAC CTCGTGC6AC CAGGGAACTG ATATACCTGT TGCAGTTGGG 

21001 ATTTAACCAC CACCGCAATG CTGGCCTGCG CTACCGCTCA ATGTTGCTGG 
TAAATTGGT6 GTG6CGTTAC GACCGGACGC GATGGCGAGT TACAACGACC 

21051 GCAATGGTCG CTATGTGCCC TTCCACATCC AGGTGCCTCA 6AAGTTCTTT 
CGTTACCAGC GATACACG6G AAGGTGTA6G TCCACGGAGT CTTCAAGAAA 

21101 6CCATTAAAA ACCTCCTTCT CCTGCCGGGC TCATACACCT ACGAGTGGAA 
CG6TAATTTT TGGAGGAAGA GGACGGCCCG AGTATGTGGA TGCTCACCTT 

21151 CTTCAGGAAG GATGTTAACA TGGTTCTGCA GAGCTCCCTA 6GAAATGACC 
GAAGTCCTTC CTACAATTGT ACCAAGACGT CTCGAGGGAT CCTTTACTGG 

21201 TAAGGGTTGA CGGAGCCAGC ATTAAGTTTG ATAGCATTTG CCTTTACGCC 
ATTCCCAACT GCCTCGGTCG TAATTCAAAC TATCGTAAAC GGAAATGCGG 

21251 ACCTTCTTCC CCATGGCCCA CAACACCGCC TCCACGCTTG AGGCCATGCT 
TGGAAGAAG6 GGTACCGGGT GTTGTGGCGG AGGTGCGAAC TCCGGTACGA 

21301 TAGAAACGAC ACCAACGACC AGTCCTTTAA CGACTATCTC TCCGCCGCCA 
ATCTTTGCTG TGGTTGCTGG TCAGGAAATT GCTGATAGAG AGGCGGCGGT 

21351 ACATGCTCTA CCCTATACCC GCCAACGCTA CCAACGTGCC CATATCCATC 
. TGTACGAGAT -GGGATATGGG .CGGTTGCGAT GGTTGCACGG GTATAGGTAG 

214 01 CCCTCCCGCA ACTGGGCGGC TTTCCGCGGC TGGGCCTTCA CGCGCCTTAA 
GGGAGGGCGT TGACCCGCCG AAAGGCGCCG ACCCGGAAGT GCGCGGAATT 

21451 GACTAAGGAA ACCTCATCAC TGGGCTCGGG CTACGACCCT TATTACACCT 
CTGATTCCTT TCGGGTAGTG ACCCGAGCCC GATGCTGGGA ATAATGTGGA 

21501 ACTCTGGCTC TATACCCTAC GTAGATGOAA CCTTTTACCT CAACCACACC 
TGAGACCGA6 ATATGGGATG GATCTACCTT GGAAAATGGA GTTGGTGTGG 

21551 TTTAAGAAGG TGGCCATTAC CTTTGACTCT TCTGTCAGCT GGCCTGGCAA 
AAATTCTTCC ACCGGTAATG GAAACTGAGA AGACAGTCGA CCGGACCGTT 

21601 TGACCGCCTC CTTACCCCCA ACGAGTTTGA AATTAAGCGC TCAGTTGACG 
ACTGGCGGAC GAATGGGGGT TGCTCAAACT TTAATTCGCG AGTCAACTGC 

21651 GGGAGGGTTA CAACGTTGCC CAGTGTAACA T6ACCAAACSA CTGGTTCCTG 
CCCTCCCAAT GTTGCAACGG GTCACATTGT ACTGGTTTCT GACCAAGGAC 

21701 GTACAAATGC TAGCTAACTA TAACATTGGC TACCAGGGCT TCTATATCCC 
CA1X3TTTACG ATCGATTGAT ATTGTAACCG ATG6TCCCGA AGATATA6GG 

21751 AGAGAGCTAC AAGGACCGCA TGTACTCCTT CTTTAGAAAC TTCCAGCCCA 
TCTCTCGATG TTCCTGGCGT ACATGAGGAA GAAATCTTTG AAGGTCGGGT 
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21801 TGAGCCGTCA (^KGTGGAT GATACTAAAT ACAAGGACTO«-C(3WiCA^gK3 
ACTCCGCAGT d^CACCTA CTATGATTTA T6TTCCTGAT GGTTGa^Pc 

21851 GGCATCCTAC ACCAACACAA CAACTCTGGA TTTGTTGGCT ACCTTGCCCC 
CCGTAGGATG TGGTTGTGTT GTTGAGACCT AAACAACCGA TGGAACGGGG 

- 21901 CACCATGCGC GAA6GACAGG CCTACCCTGC TAACTTCCCC TATCCGCTTA 
GTGGTACGCG CTTCCTGTCC GGATGGGACG ATTGAACGGG ATAGGCGAAT 

21951 TAGGCAAGAC CCCAGTTCAC AGCArTACCC AGAAAAAGTT TCTTTGCGAT 
ATCCGTTCTG GCGTCAACTG TCGTAATGGG TCTTTTTCAA AGAAACGCTA 

22001 CGCACCCTTT GGCGCATCCC ATTCTCCAGT AACTTTATGT CCATGGGCGC 

gcgtgggaaa ccgcgtaggg taagaggtca ttgaaataca ggtacccgcg 
22051 actcacagac ctgggccaaa accttctcta cgccaactcc gcccacgcgc 

TCAGTGTCTG GACCCG6TTT TGGAAGAGAT GCGGTTOAGG CGGGTGCGCG 

22101 TAGACATGAC TTTTGAG6TG GATCCCATGG ACGAGCCCAC CCTTCTTTAT 
ATCTGTACTG AAAACTCCAC CTAGGGTACC TGCTCGGGTG GGIAAGAAATA 

22151 GTTTTGTTTG AAGTCTTTGA C6TGGTCCGT GTGCACCAGC CGCACCGCGG 
CAAAACAAAC TTCAGAAACT GCACCAGGCA CACGTGGTCG GCGTGGCGCC 

22201 CGTCATCGAA ACCGTGTACC TGCGCACGCC CTTCTCGGCC GGCAACGCCA 
GCAGTAGCTT TGGCACATGG ACGCCTGCGG GAAGAGCCGG CCGTTGCGGT 

22251 CAACATAAAG AAGCAAGCAA CATCAACAAC AGCTGCCGCC ATGGGCTCCA 
GTTGTATTTC TTCGTTCGTT GTAGTTGTTG TCGACGGCGG TACCCGAGGT 

" 22301 GTGAGCAGGA ACTGAAAGCC ATTGTCAAAG ATCTTGGTTG TGGGCCATAT 
CACTCGTCCT TGACTTTCGG TAACAGTTXC TAGAACCAAC ACCCGGTATA 

22351 TTTTTGGGCA CCTATGACAA GCGCTTTCCA GGCTTTGTTT CTCCACACAA 
AAAAACCCGT G6ATACTGTT CGCGAAAGGT CCGAAACAAA GA66TGTGTT 

22401 GCTCGCCTGC GCCATAGTCA ATACGGCCX5G TCGCGAGACT GGGGGCGTAC 
CGA6CGGACG CGGTATCAGT TATGCCGGCC AGCGCTCTGA CCCCCGCATG 

22451 ACTGGATGGC CTTTGCCTGG AACCCGCACT CAAAAACATG CTACCTCTTT 
TGACCTACCG GAAACGGACC TTGGGCGTGA GTTTTTGTAC GATGGAGAAA 

22501 GAGCCCTTTG GCTTTTCTGA CCAGCGACTC AAGCAGGTTT ACCAGTTTGA 
CTCGGGAAAC CGAAAAGACT GGTCGCTGAG TTCGTCCAAA TGGTCAAACT 

22551 GTACGAGTCA CTCCTGCQCC GTAGCGCCAT TGCTTCTTCC CCCGACCGCT 
CATGCTCAGT GAGGACGCGG CATCGCGGTA ACGAAGAAGG GGGCTGGCGA 

22601 GTATAACCCT 6GAAAAGTCC ACCCAAAGCG TACAGGGGCC CAACTCGGCC 
CATATTGC6A CCTTTTCAG6 TGGGTTTCCC ATGTCCCCG6 CTTGACCCGG 

22651- GCCTGTGGAC TATTCTQCTG CATGTTTCTC CACGCCTTTG CCAACTGGCC 
CGiSACACCTG AOSIAGACGAC GTACAAA6AG GTGCGGftAAC GGTTGACC60 

22701 CCAAACTOCC ATGGATCACA ACCCCAC'CAT GAACCTTATT ACCGGGGTAC 
GGrrTGAGGG TACCTAGTGT TGGGGTGGTA CTTGGAATAA TGGCCCCATG 
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22751 CCAACTCCAT GCTCAACAGT CCX;CAGG?TAC }^CChC(^S^^Q^^q^^C : 
GGTTGAG6TA ,q^^TTCn*CA GGGGTCC21T6 TCG6GTGGGA CGCAGC||pG 

22B01 CAGGAACAGC TCTACAGCTT CCTGGAGCGC CACTCGCCCT ACTTCCGCAG 
GTCCTTGTCG AGAT6TCGAA GGACCTCGC6 GTGAGCGGGA TGAAGGCGTC 

22851 CCACAGTGCG CAGATTAGGA GCGCCACTTC TTTTTGTCAC TTGAAAAACA 
GGTG7CACGC GTCTAATCCT CGCGGTGAA6 AAAAACAGT6 AACTTTTTGT 

22901 TCTAJUIAATA ATGTACTAGA CACACTTTCA ATAAAGGCAA ATGCTTTTAT 
ACATTTTTAT TACATGATCT CTGTGAAAGT TATTTCCGTT TACGAAAATA 



22951 TTGTACACTC TCGGGTGATO ATTTACCCCC ACCCTTGCCG TCTGCGCCGT 
AACATGTGAG AGCCCACTAA TAAATGGGGG TGGGAACGGC AGACGCGGCA 

23001 TTAAAAATCA AAGGGGTTCT 6CCGCQCATC GCTATGCGCC ACTGGCAGGG 
AATTTTTAGT TOCCCCAAGA COGCGCGTAG C6ATACGCGG TGACCGTCCC 

23051 ACACGTTGCG ATACTGGTGT TTAGTGCTCC ACTTAAACTC AGGCACAACC 
TGTGCAACGC TATGACCACA AATCACGAGG TGAATTTGAG TCCGTGTTGG 

23101 ATCCGCGGCA GCTCGGTGAA GTTTTCACTC CACAGGCTGC 6CACCATCAC 
TAGGCGCCGT CGAGCCACTT CAAAAGTCAG GTGTCCGACG" CCTGGTAGTG 

23151 CAACGCGTTT AGCAGGTCGG GCGCCGATAT CTTGAAGTCG CAGTTGGGGC 

GTTGCGCAAA TCGTCCAGCC CGCGGCTATA gaacttcagc gtcaaccccg 

23201 ctccgccctg cgcgcgcgag ttgcgataca cagggttgca gcactggaac 
gaggcgggac gcgcgcgctc aacgctatgt gtcccaacgt cgtgaccttg 

23251 actatcagcg ccgggtggtg cacgctggcc agcacgctct tgtcggagat 
tgatagtcgc ggcccaccac gtgcgaccgg tcgtgcgaga acagcctcta 

23301 cagatccgcg tccaggtcct ccgcgttgct cagggcgaac ggagtcaact 
gtctaggcgc aggtccagga ggcgcaacga gtcccgcttg cctcagttga 

23351 ttggtagctg ccttcccaaa aagggcgcgt gcccaggctt tgagttgcac 

AACCATCGAC GGAAGGGTTT TTCCCGCGCA CGGGTCCGAA ACTCAACGTG 

23401 TCGCACCGTA 6TGGCATCAA AAGGTGACCG TGCCCGGTCT GGGCGTTAGG 
AGCGTGGCAT CACCGTAGTT TTCCACTGGC ACGGGCCAGA CCCGCAATCC 

23451 ATACAGCGCC TGCATAAAAG CCTTCATCTG CTTAAAAGCC ACCTGAGCCT 
TATGTCGCG6 ACGTATTTTC GGAAC7AGAC GAATTTTCGG TQGACTCGGA 

23501 TTGCGCCTTC A6AGAAGAAC ATGCCGCAAG ACTTGCCGGA AAACTGATTG 
AACGCGGAAG TCT CTT CTT G TACGGCGTTC TGAACGGCCT TTTGACTAAC 

23551 GCCOGACAGG CCQCGTOGTG CACGCA6CAC CTTCCGTCGG TCTTGGAGAT 
CGGCCTGTCC GGCGCAQCAC GTGCGTCGTG GAACOCAGCC ACAACCTCTA 

23601 CTGCACCACA TTTCGGCCCC ACCG6TTCTT CACGATCTTG GCCCTGCTAG 
6ACGTGGTGT AAAGCCGGGG TGGCCAAGAA G7GC7AGAAC CGGAACGATC 

23651 ACTGCTCCTT CAGCGCGCGC TGCX:CGTTTr CGCTCGTCAC ATCCATTTCA 
TGACGAGGAA GTCGCGCGCQ ACGGGCAAAA GCGAGCAGTG TAG6TAAAGT 
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23701 ATCACGTGCT (SRatTTAT CATAATtSCTT CCGTGTAGAC ACTTAaIRc 

TAGTGCACGA OGAATAAATA GTATTACGAA GCCACATCTG TGAATTCGAG 

23751 GCCTTCGATC TCAGCGCAGC GGTGCAGCCA CAACGCGCAG CCCGTGGGCT 

CGGAAGCTAG AGTCGCGTCG CCACGTCXSGT GTTGCGCGTC GGGCACCCGA 

238 01 CGTGATGCTT GTAGGTCACC TCTGCAAACG ACTGCAGGTA CGCCTGCAGG 

GCACTACGAA CATCCAGTGG A6ACGCTTGC TGACGTCCAT GCGGACGTCC 

23B51 AATCGCCCCA TCATCGTCAC AAAGGTCTCG TTGCTGGTGA AGGTCAGCTG 

TTAGCGGGGT AGTAGCAGTG TTTCCAGAAC AACGACCACT TCCAGTCGAC 



23901 CAACCCGCGG TGCTCCTCGT TCAGCCAGGT CTTGCATACG GCCGCCAGAG 
GTTOOGCGCC ACGAGGAGCA AGTCGCTCCA GAACGTATGC CGGCGGTCTC 

23951 CTTCCACTTG CTCAGGCAGT AGTTTGAAGT TCGCCTTTAG ATCGTTATCC 
GAAGGTGAAC CAGTCCGTCA TCAAACTTCA AGCGGAAATC TAGCAATAGG 

24001 ACGTGGTACT TGTCCATCAG CGCGCGCGCA GCCTCCATGC CCTTCTCCCA 
TOCACCATGA ACA6GTAGTC GCGCGCGCGT CGGAGGTACG GGAAGAGGGT 

24051 CGCAGACACG ATCGGCACAC TCAGCGGGTT CATCACCGTA ATTTCACTTT 
GCGTCTCTGC lAGCCGTGTG AGTCGCCCAA GTAGTGGCAT TAAAGTGAAA 

24101 CC6CTTCGCT GGGCTCTTCC TCTrCCTCTT GCGTCCGCAT ACCACGCGCC 
GGC6AAGCGA CCCGAGAAGG AGAAGGAGAA CGCAGGCGTA TGGTGCGCGG 

24151 ACTGGGTCGT CTTCATTCAG CCGCCGCACT GTGCGCTTAC CTCCTTTGCC 
TGACCCAGCA GAAGTAAGTC GGCGGCGTGA CACGCGAATG GAGGAAACGG 

24201 ATGCTTGATT AGCACCGGTG GGTTGCTGAA ACCCACCATT TGTAGCGCCA 
TACGAACTAA TCGTGGCCAC CCAACGACTT TGGGTG6TAA ACATCGCGGT 

24251 CATCTTCTCT TTCTTCCTCG CTGTCCACGA TTACCTCTGG TGATGGCGGG 
GTAGAAGA6A AAa^AGGAGC GACAGGTGCT AATGGAGACC ACTACCGCCC 

24301 CCCTCGGGCT TGGGAGAAGG GCGCTXTTT 7TCTTCTTGG GCGCAATGGC 
GCGAGCCCGA ACCCTCTTCC CGCGAAGAAA AAGAAGAACC CGCGTTACCG 

24351 CA7ATCCGCC GCCGAG6TCG ATGGCCGCGG GCTGGGTGTG CGCGGCACCA 
GTTTAGGCGG CGGCTCCAGC TACCGGCGCC CGACCCACAC GCGCCGTGGT 

24401 GCGCGTCTTG ?GATGAGTCT TCCTCGTCCT CGGACTCGAT ACGCCGCCTC 
CGCGCAGAAC ACTACTCA6A AGGA6CAGGA GCCTGAGCTA TGCGGCGGAG 

24451 ATCCGCTTTT TTGGGGGCGC CCGGGGAGGC GGCGGCGACG GGGACGGGGA 
TAGGCGAAAA AACCCCCGCG 6GCCCCTCCG CCGCCGCTGC CCCTGCCCCT 

24501 CGACACGTCC TCCATGGTTG GQGGAC6TCG CGCCGCACCG CGTCCGCGCT 
GCTGTGCAGG AGGTACCAAC CCCCT6CA6C GCGGCGTGGC GCAGGCGCGA 

24551 CGGGGGTGGT TTCGCGCTGC TCCTCTTCCG GACTGGCCAT TTCCTTCTCC 
GCCCCCACCA AACCGCGACG AGGA6AAGGG CTGACCGGTA AAGGAAGAGG 

24601 TATAGGCAGA AAAAGATCAT GGAGTCAGTC GAGAAGAAG6 ACAGCCTAAC 
ATATCCGTCT TTTTCTAGTA CCTCAGTCAG CTCTTCTTCC TGTCGGATTG 
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2465X CGCCCCCTCT 




GCCA CCACCGCCTC CACCGATCCC GL'CAA* 




6CG6GGGAGA CKAAGCGGT GGT6GCGG;iG GTGGC7ACGG CGGTTGRcQ 

24701 CTACCACCTT CCCCGTCGAG GCACCCCCGC TTGAGGAGGA GGAAGTGATT 
6ATGGTGGAA GG6GCAGCTC C6TGGGGGC6 AACTCCTCCT CCTTCACTAA 

24751 ATCGAGCAGG ACCCAGGTTT TGTAAGCGAA GACGACGAGG ACCGCTCAGT 
TAGCTCGTCC TGGGTCCAAA ACATTCGCTT CTGCTGCTCC TGGCGAGTCA 

24801 ACCAACAGAG GA7AAAAA6C AAGACCAGGA CAAC6CAGAG GCAAACGAGG 
TGGTTGTCTC CTATTTTTCG TTCTGGTCCT GTTGCGTCTC CGTTTGCTCC 

24851 AACAAGTCGG GCGGGGGGAC 6AAAGGCATG GCGACTACCT AGATGTGGGA 
TTGTTCAGCC CGCCCCCCTG CTTTCC6TAC C6CTGATGGA. TCTACACCCT 

24901 GACGACGTGC TGTTGAAGCA TCTGCAGCGC CAGTGCGCCA TTATCTGCGA 
CTGCTGCACG ACAACTTCGT AGACGTCGCG. GTCACGCGGT AATAGACGCT 

24951 CGCGTTGCAA GAGCGCAGCG ATGTGCCCCT CGCCATAGCG GATGTCAGCC 
GCGCAACGTT CTCGCGTC6C TACACG6GGA GC6GTATCGC CTACAGTCGG 

25001 TTGCCTACGA ACGCCACCTA TTCTCACCGC GCGTACCCCC CAAACGCCAA 
AACGGATGCT TGCGGTGGAT AAGAGTG6CG CGCATGGGGG GTTTGCGGTT 

25051 GAAAACGGCA CATGCGAGCC CAACCCGCGC CTCAACTTCT ACCCCGTATT 
CTTTTGCCGT GTACGCTCGG GTTGGGCGCG GAGTTGAAGA TGGGGCATAA 

25101 TGCCGTGCCA GAGGTGCTTG CCACOTATCA CATCTTTTTC CAAAACTGCA 
ACGGCACGGT CTCCACGAAC GGTGGATAGT GTAGAAAAAG GTTTTGACGT 

25151 AGATACCCCT ATCCTGCCGT GCGAACCGCA GCC6AGCGGA CAAGCAGCTG 
TCTATGGGGA TAGGACGGCA CGGTTGGCGT CGGCTCGCCT GTTCGTCGAC 

25201 GCCTTGCGGC AGGGCGCTGT CATACCTGAT ATCGCCTCGC TCAACGAAGT 
CGGAACGCCG TCCCGCGACA GTATGGACTA TAGC6GAGCG AGTTGCTTCA 

25251 GCCAAAAATC TTTGAGGGTC TTGGACGCGA CGAGAAGCGC GCGGCAAACG 
CGGTTTTTAG AAACTCCCAG AACCTGCGCT GCTCTTCGCG CGCCGTTTGC 

25301 CTCTGCAACA GGAAAACAGC GAAAATGAAA GTCACTCTGG AGTGTTGGTG 
GAGACGTTGT CCTTTTGTCG CTTTTACTTT CAGTGAGACC TCACAACCAC 

25351 GAACTCGAGG GTGACAACGC GCGCCTAGCC GTACTAAAAC GCAGCATCGA 
CTTGAGCTCC CACTGTTGCG CGCGGATCGG CATGATTTTG CGTCGTAGCT 

25401 GGTCACCCAC TTTGCCTACC CGGCACTTAA CCTACCCCCC AAGGTCATGA 
CCAGTGGGTG AAACGGATGG GCCGT6AATT GGATGGGGG6 TTCCAGTACT 

25451 GCACAGTCAT GAGTGAGCTG ATCGTGCGCC GTGCGCAGCC CCTGGAGAGG 
CGT6TCAGTA CTCACTCGAC TAGCACGCGG CACGCGTCGG OGACCTCTCC 

25501 GATGCAAATT TGCAAGAACA AACAGAGGAG GGCCTACCCG CAGTTGGCGA 
CTACGTTTAA ACGTTCT T GT TTCTCTCCTC CCGGATGG6C GTCAACCGCT 

m 

25551 CCAGCAGCTA GCBCGCTQGC TTCAAACGCG CGAGCCTGCC GACTTGGAGG 
OCTCGTCGAT CGCGC6ACCG AAGTTTGCGC GCTCGGACGG CT6AACCTCC 
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25601 ftGCGACGO^ ^^ATGATG GCC6CAGTGC TCGTTACCGT OGAGCII^G 
TCGCTGCGTT TOITTACTAC CGGCGTCACG ACCAATGGCA CCTCGAACTC 

25551 TGCATGCAGC GGTTCTTTOC TGACCCGGAG ATGCACCCCA AGCTAGAOGA 
ACGTACGTCG CCAAGAAACG" ACT6QGCCTC TACGTCGCGT TCGATCTCCT 

25701 AACATTGCAC TACACCTTTC GACAGGGCTA CGTAC6CCAG GCCTGCAAGA 
TTGTAACGTG ATGTGGAAAG CTGTCCCGAT GCATGCCCTC CGGACCTTCT 

25751 TCTCCAACGT GGAGCTCTGC AACCTGGTCT CCTACCTTGC AATTTTGCAC 
AGAGGTTGCA CCTCGAGACG TTGGACCAGA GGAT6GAACC TTAAAACGTG 



25801 GAAAACCGCC TTGGGCAAAA CGTSCTTCAT TCCACGCTCA AGGGCGAGGC 
CTTTTGGCGG AACCCGTTTT GCACGAAGTA AGGTGCGAGT TCCCGCTCCG 

25851 GCGCCGCGAC TACGTCCGCG ACTGCGTTTA CXTATTTCTA TQCTACACCT 
CGCGGCGCTG ATGCAGGCGC TGACGCAAAT CAATAAAGAT ACGATCTGGA 

25901 GGCAGACGGC CATGGGCGTT TGGCAGCAGT GCTTGGAGGA GTGCAACCTC 
CCGTCTGCCG GTACCCGCAA ACCGTCGTCA CGAACCTCCT CACGTTGGAG 

25951 AAGGAGCTGC AGAAACTGCT AAAGCAAAAC TTGAAGGACC TATGGACGGC 
TTCCTCGACG TCTTTGACGA TTTCGTTTTG AACTTCCTGG ATACCTGCCG 

26001 CTTCAACGAG C6CTCCGTGG CCGCGCACCT GGCGGACATC ATTTTCCCCG 
GAAGTTGCTC GCGAGGCACC GGCGCGTGGA CCGCCTGTAG TAAAAGGGGC 

26051 AACGCCTGCT TAAAACCCTG CAACAGGGTC TGCXAGACTT CACCAGTCAA 
TTGCGGACGA ATTTTGGGAC GTTGTCCCAG ACGGTCTGAA GTGGTCAGTT 

26101 AGCATGTTGC AGAACTTTAG GAACTTTATC CTAGAGCGCT CAGGAATCTT 
TCCTACAACG TCTTGAAATC CTTGAAATAG GATCTCGCGA GTCCTTAGAA 

26151 GCCCGCCACC TGCTGTGCAC TTCCTAGCGA CTTTGTGCCC ATTAAGTACC 
CGGGCGGTGG ACGACACGTG AAGGATCGCT GAAACACGGG TAATTCATGG 

26201 GCGAATGCCC TCCGCCGCTT TGGGGCCACT GCTACCTTCT GCAGCTAGCC 
CGCTTACGGG AGGCGGCGAA ACCCCGGTGA CGATGGAAGA CGTCGATCGG 

26251 AACTACCTTG CCTACCACTC TGACATAATG GAAGACGTGA GCGGTGACGG 
TTGATGGAAC GGATGGTGAG ACTGTArTAC CTTCTGCACT CGCCACTGCC 

26301 TCTACTGGAG TGTCACTGTC GCTGCAACCT ATGCACCCCG CACCGCTCCC 
AGATGACCTC ACACTGACAG CGACGTTGGA TACGTGGGGC GTGGCGAGGG 

26351 TGGTTTGCAA TTCGCAGCTG CTTAACGAAA GTCAAATTAT CGGTACCTTT 
ACCAAACGTT AAGCGTCGAC GAATTOCTTT CAGTTTAATA GCCAT6GAAA 

26401 GAGCTGCAGG GTCCCTCGCC TGACGAAAAG TCCGCGGCTC CGGGGTTGAA 
CTC6ACGTCC CAGGGAOCGG ACTGCrTTTC AGGCGCCGAG GCCCCAACTT 

26451 ACTCACTCCG GGGCTGTGGA CGTCGGCTTA CCTTCGCAAA TTTGTACXTG 
TGAGTGAGGC CCCGACACCT GCAGCCGAAT GGAAGCGTTT AAACATGGAC 

26501 AGGACTACCA CGCCCACGAG ATTAG3TTCT ACGAAGACCA ATCCCGCCCG 
TCCTGATGGT GCCGCTGCTC TAATCCAAGA T6CTTCTGGT TAGGGCGGGC 
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26551 CCTAATGCGG ^JIRtaCCGC CTCCGTCATT ACCCAGGGCC ACATTC^IPSG 
GGATTACGCC TCGAATOGCG GACGCAGTAA TGGGTCCCGG TGTAAGAACC 

26601 CCAATTGCAA GCCATCAACA AAGCCCGCCA AGAGTTTCTG CTACGAAAGG 
GGTTAACGTT CGGTAGTTGT TTCGGQCGGT TCTCAAAGAC GArGCTTTCC 

26651 GACGGGGGGT rTACTTCGAC CCCCAGTCCG GCGAGGAGCT CAACCGAATC 
CTGCCCCCCA AATGAACCTG GGGGTCAGGC CGCTCCTCGA GTTGGGTTAG 

26701 CCCCCGCCGC CGCAGCCCTA TCAGCAGCAG CCGCGGGCCC TTGOTTCCCA 
GGGGGCGOCG GCGTCGGGAT AGTCGTCGTC GGCGCCCGfGG AACGAA6GGT 



26751 GGATGGCACX: CAAAAAGAAG CTGCAGCTGC C6CCGCCACC CACX5GACGAG 
CCTACCGTGG GTTTTTCTTC GACGTC6ACG GCGGCGGTGG GTGCCTGCTC 

26B01 GAGGAATACT GGGACAGTCA GGCAGAGGAG GTTTTGGACG AGGAGGAGGA 
CTCCTTATGA CCCTGTCAGT CCGTCTCCTC CAAAACCTGC TCCTCCTCCT 

26B51 GGACATGATG GAAGACTGGG AGAGCCTAGA CGAGGAAGCT TCCGAGGTCG 
CCTGTACTAC CTTCTGACCC TCTCGGATCT GCTCCTTCGA AGGCTCCAGC 

26901 AAGAGGTGTC AGACGAAACA CCGTCACCCT CGGTCGCATT CCCCTCGCCG 
TTCTCCACAG TCTGCTTTGT GGCAGTGGGA GCCAGCGTAA GGGGAGCGGC 

26951 GCGCCCCAGA AATCGGCAAC CGGTTCCAGC ATGGCTACAA CCTCCGCTCC 
CGCGGGGTCT TTAGCCGTT6 GCCAAGGTCG TACCGATGTT GGAGGCGAGG 

27001 TCAGGCGCCG CCGGCACTGC CCGTTCGCCG ACCCAACCGT AGATGGGACA 
AGTCCGCGGC G6CCGTGACG GGCAAGCGGC TGGGTTGGCA TCTACCCTGT 

27051 CCACTGGAAC CAG3GCCG6T AAGTCCAAGC AGCCGCCGCC GTTAGCCCAA 
GGTGACCTTG GTCCCGGCCA TTCAGGTTCG TCGGCGGCGG CAATCGGGTT 

27101 GAGCAACAAC AGCGCCAAGG CTACCGCTCA TGGCGCGGGC ACAAGAACGC 
CTCGTTGTTG TCGCG6TTCC GATGGCGAGT ACCGCGCCCG TGTTCTTGCG 

27151 CATAGTTGCT TGCTTGCAAG ACTGTGGGGG CAACATCTCC TTCGCCCGCC 
GTATCAACGA ACGAACGTTC TGACACCCCC GTTGTAGAGG AAGCGGGCGG 

27201 GCTTTCTTCT CTACCATCAC GGCGTGGCCT TCCCCCGTAA CATCCTGCAT 
CGAAAGAAGA GATGGTAGTG CCGCACCGGA AGGGGGCATT GTAGGACGTA 

27251 TACTACCGTC ATCTCTACAG CCCATACTGC ACCGGCGGCA GCGGCAGCAA 
ATGATGGCAG TAGA6ATGTC GGGTATGACG TGGCCGCCGT CGCCGTCGTT 

27301 CAGCAGCGGC CACACAGAAG CAAAGGCGAC CGGATAGCAA GACTCTGACA 
CTCGTCGCCG 6TGTGTCTTC GTTTCCGCTG GCCTATCGTT CTGAGACTGT 

27351 AAGCCCAAGA AATCCACAGC GGCGGCAGCA GCAGGAGGAG GAGCGCTGCG 
TTCGGGTTCT TTAGGTGTCG CCGCCGTCGT CGTCCTCCTC CTCGCGACGC 

27401 TCTGGCGCCX: AACGAACCCG TATCGACCCG CGAGCTTAGA AACAGGATTT 
AGACCGCCGG TTGCTTGGGC ATAGCTGGGC GCTCGAATCT TTGTCCTAAA 

27451 TTCCCACTCT GTATGCTATA TTTCAACAGA GCAGGGGCCA AGAACAAGAG 
AAGOGTGAGA CATACGATAT AAAGTTGTCT CGTCCCCGGT TCTTGTTCTC 
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27501 CTGAAAATAA ftM^ChSGTC TCTGCGATCC CTCACCCCCA GCTGCq»A 
GACTTTTATT TmTGTCCAG AGACGCTAGG GAGTGGGCGT CGACGGXaT 

• 27551 TCACAAAAGC GAAGATCAOC TTCG6CGCAC GCTGGAAGAC GCGGAGGCTC 
AGTCTTTTCG CTTCTAGTCG AAGCCGCGTG CGACCTTCT6 CGCCTCCGAG 

27601 TCTTCAGTAA ATACTGCGCG" CTGACTCTTA AGGACTAGTT TCGCGCCCTT 
AGAAGTCATT TATGACGCGC GACTGAGAAT TCCTGATCAA AGCGCGGGAA 

27551 TCTCAAATTT AAGCGCGA&A ACTACGTCAT CTCCAGCGGC CACACCCGGC 
AGAGTTTAAA TTCGCGCTTT TGATGCAGTA GAGGTCGCC6 GTGTCGGCCG 



27701 6CCAGCACCT GTTGTCAGCG CCATTATGAG CAAGGAAATT CCCACGCCCT 
CGGTCGTGGA CAACAGTCGC OGTAAIACTC GTTCCTTTAA GG5TGCGGGA 

27751 ACATGTGGAG TTACCAGCCA CAAATGGGAC TTGCGGCTGG ACCTGCCCAA 
TGTACACerC AATGGTCG6T GTTTACCCTG AACGCCGACC TCGACGGGTT 

27801 GACTACTCAA CCCGAATAAA CTACATGAGC GCGGGACCCC ACATGATATC 
CTGATGAGTT GGGCTTATTT GATGTACTCG CGCCCTGGGG TGTACTATAG 

27851 CCGGGTCAAC GGAATACGCG CCCACTGAAA CCGAATTCTC CrTGGAACAGG 
GGCCCAGTTG CCTTATGCGC GGGTGGCTTT GGCTTAAGAG GACCTTGTCC 

27901 CGGCTATTAC CACCACACCT CGTAATAACC TTAATCCCCG TAGTTGGCCC 
GCCGATAATG GTGGTGT6GA GCATTATTG6 AArTAGGGGC ATCAACCGGG 

27951 GCTGCCCTGG TGTACCAGGA AAGTCCCGCT CCCACCACT6 TQGTACTTCC 
CGACGGGACC ACATGGTCCT TTCAGGGCGA 6GGTGGTGAC ACCATGAAGG 

28001 CAGAGACGCC CAGGCCGAAG TTCAGATCAC TAACTCAGGG GCGCAGCTTG 
GTCTCTGCGG GTCCGGCTTC AA6TCTACTG ATTGAGTCCC CGCGTCGAAC 

28051 CGGGCGGCTT TCGTCACAGG GTGCGGTCGC CCGGGCAOGG TATAACTCAC 
GCCCGCCGAA AGCAGTGTCC CACGCCAGCG GGCCC6TCCC ATATTGAGTG 

28101 CTGACAATCA GAGGGCGAGG TATTCAGCTC AACGACGAGT CGGTGAGCTC 
GACTGTTAGT CTCXCGCTCC ATAAGTCGAG TTGCTGCTCA GCCACTCGAG 

28151 CTCGCTTGGT CTCCG7CCGG ACGGGACATT TCAGATCGGC GGCGCCGGCC 
GAGCGAACCA GAGGCAGGCC TGCCCTGTAA AGTCTAGCCG CCGCGGCCGG 

28201 GCTCTTCATT CACGCCTCGT CAGGCAATCC TAACTCTGCA GACCTCGTCC 
CGAGAAGTAA GTGCGGAGCA GTCCGTTAGG ATTGAGACGT CTGGAGCAGG 

28251 TCTGAGCCGC GCTCTGGAQG CATTGGAACT CTGCAATTTA TTGAGGAQTT 
AGACTCGGCG CGAGACCTCC GTAACCTTGA GACGTTAAAT AACTCCTCAA 

28301 TGTGCCATCG GTCTACTTTA ACCCCTTCTC CGGACCTCCC GGCCACTATC 
ACACGGTAQC CAGATGAAAT TGGGGAAGAG CCCTGGAGGG CCGGTGATAG 

28351 CGGATCAATT TATTCCTAAC TTTGACGCGG TAAAGGACTC GGCGGACGGC - 
GCCTAGTTAA ATAAGGATT6 AAACTGCGCC ATTTCCTGAG CC6CCTGCC6 

28401 TACGACTGAA TGTTAAGTGG AGAGGCAGAG CAACTGCGCC TGAAACACCT 
ATGCTGACTT ACAATTCACC TCTCCGTCTC GTTGACGCGG ACTTTGTGGA 
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28451 GGTCCACTGT dlftOCCACA AGTGCTTTGC CCGCGACTCC GGTGAblPl i 
CCAGGTGACA 6C6GCGGT6T TCACQAAACG GGCGCT6AGG CCACTCAAAA 

28501 GCTACTTTGA ATTGCCCGAG GATCATATCG AGGGCCCGGC GCACGGCGTC 
CGATGAAACT TAACGGGCTC CTAGTATAGC TCCCGGGCCG CGTGCXIGCAG 

28551 CGGCTTACCG CCCAGGGAGA GCTTGCCCGT AGCCTGATTC GGGAGTTTAC 
GCCGAATGGC 6GGTCCCTCT CGAACGGGCA TCGGAC7AAG CCCTCAAATG 

26601 CCAGCGCCCC CTGCT AGTTG AGCGGG ACAG GGGACCCTGT GT7CTCACTG 
0GTCGC6GGG GACGATCAAC TCGCCCTGTC CCCTG6GACA CAAGAGTGAC 



2B651 T6ATTTGCAA CT6TCCTAAC CCTGGATTAC ATCAAGATCT TTGTTGCCAT 
ACTAAACGTT GACAG6ATTG G6ACCTAATO TAGTTCTA6A AACAACGGTA 

28701 CTCTGTGCT6 AGTATAATAA ATACAGAAAT TAAAATATAC TGGGGCTCCT 
GAGACACGAC TCATATTATT TATGTCTTTA ATTTTATAT6 ACCCCGAGGA 

28751 ATCGCCATCC TGrAAACGCC ACCGTCTTCA CCCGCCCAAG CAAACCAAGG 
TAGCGGTAGG ACATTTGCGG TGGCAGAAGT OGGCGGGTTC' 6TTTGGTTCC 

2B801 CGAACCTTAC CT6GTACTTT TAACATCTCT CCCTCTGTGA TTTACAACAG 
GCTTGGAATG GACCATGAAA ATTGTAGAGA GGGAGACACT AAATGTTGTC 

28651 TTTCAACCCA GACGGAGTGA GTCTACGAGA GAACCTCTCC GAGCTCAGCT 
AAAGTTGGGT CTGCCTCACT CAGATGCTCT CTTGGAGAGG CTCGAGTCGA 

28S01 ACTCCATCAG AAAAAACACC ACCCTCCTTA CCTGCCGGGA ACGTACGAGT 
TGAGGTAGTC TTTTTTGTGG TGGGAGGAAT GGACGGCCCT TGCATGCTCA 

28951 GCGTCACCGG CCGCTGCACC ACACCTACCG CCTGACCGTA AACCAGACTT 
CGCAGTG6CC GGCGACGTG6 TGTGGATGGC GGACTGGCAT T7GGTCTGAA 

29001 TTTCCGGACA GACCTCAATA ACTCTGTTTA CCAGAACAGG AGGTGAGCTT 
AAAGGCCTGT CTGGAGTTAT TGAGACAAAT GGTCTTGTCC TCCACTCGAA 

29051 AGAAAACCCT CAGGGTATTA GGCCAAAGGC GCAGCTACTG TGGGGTTTAT 
TCTTTTGGGA ATCCCATAAT CCGGTTTCCG CGTCGATGAC ACCCCAAATA 

29101 GAACAATTCA AGCAACTCTA CGGGCTATTC TAATTCAGGT TTCTCTAGAA 
CTTGTTAAGT TCGTTGAGAT GCCCGATAA6 ATTAAGTCCA AA6AGATCTT 

29151 TCGG6GTTQ6 GGTTATTCTC TGTCTTGTGA TTCTCTTTAT TCTTATACTA 
AGCCCCAACC CCAATAAGAG ACAGAACACT AAGAGAAATA AGAATATGAT 

29201 ACOCTTCTCT GCCTAAGOCT CGCCGCCTGC TCTCTGCACA TTTGCATTTA 
TGCGAAGAGA CGGATTCCGA GCGGCGGAC6 ACACACGTGT AAACGTAAAT 

29251 TTGTCAGCTT TTTAAACGCT 66GGTCGCCA OCCAAGATGA TTAGGTACAT 
AACAGTCGAA AAATTTGCGA CCCCA6C6GT G6GTTCTACT AATCCATGTA 

29301 AATCCTAGGT TXACTCACCC TTGCGTCAGC CCACGGTACC ACCCAAAAGG 
TTAOGATCCA AATGAGT666 AACGCAGTCG GGTGCCATOG TGGGTTTTCC 

29351 TGGATTTTAA 6GA6CCAGCC TGTAATGTTA CATTC6CAGC TGAAGCTAAt 
ACCTAAAATT CCTCGGTCGG ACATTACAAT GTAAOCGTCG ACTTCGATTA 
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29401 GAGTGCACCA CMTTATAAA ATGCACCACA GAACATttAAA AWLTJVl^^T 
CTCACGTGGT GAGAATATTT TACGTGGTGT CTTGTACTTT TCGACgStA 

29451 TCGCCACAAA AACAAAATTG GCAAGTATGC TGTTTATGCT ATTTGGCAGC 
AGCGGTGTTT TTGTTTTAAC CGTTCATACG ACAAATACGA TAAACC6TCG 

29501 CAGGTGACAC TACAGAGTAT AATGTTACAG TTTTCCAG5G TAAAAGTCAT 
GTCCACTGTG ATGTCTCATA TTACAATGTC AAAAGGTCCC ATTTTCAGTA 

29551 AAAACTTTTA TGTATACTTT TCCATTTTAT GAAATGTGCG ACATTACCAT 
TTTTGAAAAT ACATATGAAA AGGTAAAATA CTTTACACGC TGTAATGGTA 



29601 GTACATGAGC AAACACTATA AGTTGTGGCC CCCACAAAAT TGTGTGGAAA 
CAT6TACTCG TTTGTCATAT TCAACACCGG GGGT6TTTTA ACACACCTTT 

29651 ACACTGGCAC TTTCTGCTGC ACTGCTATGC TAATTACAGT GCTCGCTTTG 
TGTGACCGTG AAAGAC6ACG TGACGATACG ATTAATGTCA CGAGCGAAAC 

29701 GTCTGTACCC TACTCTATAT TAAATACAAA AGCAGACGCA GCTTTATTGA 
CAGACATGGG ATGAGATATA ATTTATGTTT TCGTCTGCGT CGAAATAACT 

29751 GGAAAAGAAA ATGCCTTAAT TTACTAAGTT ACAAAGCTAA TGTCACCACT 
CCTTTTCTTT TACGGAATTA AATGATTCAA TGTTTCGATT ACAGTGGTGA 

29801 AACTGCTTTA CTCCCTGCTT GCAAAACAAA TTCAAAAAGT TAGCATTATA 
! TTGACGAAAT GAGCGACGAA CGTTTTGTTT AAGTTTrTCA ATCGTAATAT 

29851 ATTAGAATAG GATTTAAACC CCCCGGTCAT TTCCTGCTCA ATACCATTCC 
TAATCTTATC CTAAATTTGG GGGGCCAGTA AAGGACGAGT TATGCTAAGG 

29901 CCTGA.ACAAT TGACTCTATG TGGGATATGC TCCAGCGCTA CAACCTTGAA 
GGACTTGTTA ACTGAGATAC ACCCTATACG AGGTCGCGAT GTTGGAACTT 

29951 GTCAGGCTTC CTGGATGTCA GCATCTGACT TTGGCCAGCA CCTGTCCCGC 
CAGTCCGAAG GACCTACAGT CGTAGACTGA AACCGGTCGT GGACAGGGCG 

30001 GGATTTGTTC CAGTCCAACT ACAGCGACCC ACCCTAACAG AGATGACCAA 
CCTAAACAAG GTCAGGTTGA TGTCGCTGGG TGGGATTGTC TCTACTGGTT 

30051 CACAACCAAC GCGGCCGCCG CTACCGGACT TACATCTACC ACAAATACAC 
GTGTTGGTTG CGCCGGCGGC GATGGCCTGA ATGTAGATGG TGTTTATGTG 

30101 CCCAAGTTTC TGCCTTTGTC AATAACTGGG ATAACTTGGG CATGTGGTGG 
GGGTTCAAAG ACGGAAACAG TTATTGACCC TATTGAACCC GTACACCACC 

30151 TTCTCCATAG CGCTTATGTT TGTATGCCTT ATTATTATGT GGCTCATCTG 
AAGAGGTATC GCGAATACAA ACATACGGAA TAATAATACA CCGAGTAGAC 

30201 CTOCCTAAAG CGCAAACGCG CCCGACCACC CATCTATAGT CCCATCATTG 
GACGGATTTC GCGTTTGCGC GGGCTGGTG6 GTA6ATATCA GGGTAGTAAC 

■ 30251 TGCTACACCC AAACAATGAT GGAATCCATA GATTGGACGC ACTGAAACAC 
ACGATGTGGG TTTGTTACTA CCTTAGGTAT CTAACCTGCC TGACTTTGTG 

30301 ATGTTCTTTT CTCTTACAGT ATGATTAAAT GAGACATGAT TCCTCGAGTT 
TACAAGAAAA GAGAATGTCA TACTAATTTA CTCTGTACTA AGGAGCTCAA 
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3035L TTTATATTAC l^pCCTTGT T6CGCTTTPT TGTGCGTGCT CCACAT^pC 
AAATATAATG ACTCGGAACA ACGCGAflAA;i ACAC6CACGA GGTGTA^G 

30401 TSCGGTTTCT CACATCGAAC TAGACTGCAT TCCAGCCTTC ACAGTCTATT 
ACGCCAAA6A GTGTAGCTTC ATC76ACGTA A66TCGGAAG TGTCAGATAA 

30451 T6CTTTACGG ATTTGTCACC CTCACGCTCA TCTOCAGCCT CATCACTGTG 
AC6AAATGCC TAAACAGTGG GAG7GCSAGT AGACGTCQGA GTAGTGACAC 

30501 GTCATCGCCT TTATCCAGTG CATTGACTOG GTCTGTGTGC QCTTTGCATA 
CAGTAGCGGA AATAGGTCAC GTAACTSACC CAGACACAC6 CGJJACGTAT 



30551 TCTCAGACAC CATCCCCAGT ACAGG(3^CA6 GACTATAGCT GAGCTTCTTA 
AGAGTCTGTG GTA66GGTCA TGTCCCTGTC CTGATAICGA CTCGAAGAAT 

30601 GAATTCTTTA ATTATGAAAT TTACTGTGAC TTTTCTOCTG ATTATTTGCA 
CTTAAGAAA7 TAATACTTTA AATGACACTG AAAAGACGAC TAATAAACGT 

30651 CCCTATCTGC GTTTTGTTCC CCGACCTCCA AGCCTCAAAG ACATATATCA 
GGGATAGT^G CAAAACAAGG GGCTGGA6GT TCGGAGTTTC TGTATATAGT 

30701 TGCAGATTCA CTCGTATATG GAATATTCCA AGTTOCTACA ATGAAAAAAG 
ACGTCTAAGT GAGCATATAC CTTATAAGGT TCAACGATGT TACTTTTTTC 

30751 CGATCTTTCC GAAGCCTGGT TATATGCAAT CATCTCTGTT ATGGTGTTCT 
GCTAGAAAGG CTTCGGACCA ATATACGTTA GTAGAGACAA TACCACAAGA 

30 801 GCAGTACCAT CTTAGCCCTA GCTATATATC CCTACCTTGA CATTGGCTGG 
CGTCATGGTA GAATCGGGAT CGATATATA6 GGATGGAACT GTAACCGACC 

30B51 AACGCAATAG ATGCCATGAA CCACCC»Cr TTCCCCCCGC CCGCTATGCT 
TTGCGTTATC TACG6TACTT GGT6GGTTGA AAGGGGCGCG OGCGATACGA 

30901 TCCACTGCAA CAAGTTGTTG CCGGCGGCTT TGTCCCAGCC AATCAGCCTC 
AGGTGACGTT GTTCAACAAC GGCCGCCGAA ACAGGGTC6G TTAGTC6GAG 

30951 GCCCACCTTC TCCCACCCCC ACTGAAATCA 6CTACTTTAA TCTAACAGGA 
C6GGTGGAAG AGGGTGGGGG TGACTT7AGT CGATGAAATT AGATTGTCCT 

31001 GGAGATGACT GACACCCTAG ATCTAGAAAT GGACGGAATT ATTACAGAGC 
CCTCTACTGA CTGTGG6ATC TACATCTTTA CCTOCCTTAA TAATGTCTCG 

31051 A6CGCCTGCT AGAAAGACGC AGGGCAGCGG CCGAGCAACA GCGCATGAAT 
TCGCGGACGA TCTTTCTGCG TCCCGTCGCC G6CTCGTTGT CGCGTACTTA 

31101 CAAGAGCTCC AAGACATGGT TAACT7GCAC CAGTGCAAAA GGGGTATCTT 
GTTCTCGAGG TTCTGTACCA ATTGAACGTG CTCACGTTTT CCCCATAGAA 

31151 TTGTCTCGTA AAGCAGGCCA AAGTCACCTA CGACAGTAAT ACCACCGGAC 
AACAGAGCAT TTCGTCCGGT TTCAGTGGAT GCTGTCATTA TQGTGGCCTG 

31201 ACCGCCTTA6 CTACAAGTTG CCAACCAW3C GTCAGAAATT GGTGGTCATG 
TGGCGGAATC GATGTTCAAC GGTTGGTTCG CAGTCTraAA CCACCAGTAC 

31251 GTGGGAGAAA AGCCCATTAC CATAACTCA6 CACTCGGTAG AAACCGAAGG 
CACCCTCTTT TCGGGTAATG GTATTGAGTC GT6AGCCATC TTTGGCTTCC 
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31301 CTGCATTCAC ^:^:CTTGTC AAGGACCTGA CGATCTCTGC ACCCT^^A 
GACGTAAGTS aRgGAACAG TTCCTGGACT CCTAGAGACG TGGGAmRat 

31351 AGACCCTGTG CG6TCTCAAA GATCTTATTC CCTTTAACTA ATAA AAAAA A 
TCTGGGACAC GCCAGAGTTT CTAGAATAAG GGAAATTGAT TATTTTTTTT 

31401 ATAATAAAGC ATCACTTACT TAAAATCAGT TAGCAAATTT CTCTCCAGTT 
TATTATTTCG TAGTQAAT6A ATTTTAGTCA ATCGTTTAAA GACAGGTCAA 

31451 TATTCAGCAG CACCTCCTTG CCCTCCTCCC AGCTCT6GTA TTGCAGCTTC 
ATAAGTCGTC GTGGAGGAAC GGGAGGAGGG TCGAGACCAT AACGTCGAAG 



31501 CTCCTGGCTG CAAACTTTCT CCACAATCTA AATGGAATGT CAGTTTCCTC 
GAGGACCGAC GTTTGAAAGA GGTGTTA6AT OTACCTTACA GTCAAACGAG 

31551 CTGTTCCTGT CCATCCGCAC CCACTATCTT CATGTTGTTG CACATGAAGC 
GACAAGGACA GGTAGGCGTG GGTGATAGftA CTACAACAAC GTCTACTTCG 

31601 GCGCAAGACC GTCTGAAGAT ACCTTCAACC CCGTGTATCC ATATGACACG 
CGC6TTCTGG CAGACTTCTA TGGAAGTTGG GGCACATAGG TATACTGTGC 

31651 GAAACCGGTC CTCCAACTGT GCCTTTTCTT ACTCCTCCCT OTCTATCCCC 
CTTTGGCCAG GAGGTTGACA CGGAAAAGAA TGAGGAGGGA AACATAG6GG 

31701 CAATGGGTTT CAAGAGAGTC CCCCTGGGGT ACTCTCTTT6 CGCCTATCCG 
GTTACCCAAA GTTCTCTCAG GGGGACCCCA TGAGAGAAAC GCGGATAGGC 

31751 AACCTCTAGT TACCTCCAAT GGCATGCTTG CGCTCAAAAT GGGCAACG6C 
TTGGAGATCA ATGGAGGTTA CCGTACGAAC GCGAGTTTTA CCCGTTGCCG 

31801 crCTCTCTGG ACGAGGCCGG CAACCTTACC TCCGJUAATG TAACCACTGT 
GAGAGAGACC TGCTCCGQCC GTTGGAATGG ACGGTTTTAC ATTCGTGACA 

31851 GAGCCCACCT CTCAAAAAAA CCAAGTCAAA CATAAACCTG GAAATATCTG 
CTCGGGTGGA GAGTTTTTTT GGTTCAGTTT GTATTTGGAC CTTTATAGAC 

3 19 01 CACCCCTCAC AGTTACCTCA GAAGCCTTAA CTGTGGCTGC CGCCGCACCT 
GTGGGGAGTG TCAATGGAGT CTTCG3GATT GACACCGACG GCGGCGTGGA 

31951 CTAATGGTCG CGGGCAACAC ACTCACCATG CAATCACAGG CCCCGCTAAC 
GATTACCAGC GCCCGTTGTG TGAGTGGTAC GTTAGTGTCC GGGGCGATTG 

32001 CGTGCACGAC TCCAAACTTA GCATTGCCAC CCAAGGACCC CTCACAGTGT 
GCACGTGCTG AGGTTTGAAT CGTAACGGTG GGTTCCTGGG GAGTGTCACA 

32051 CAGAAGGAAA GCTAGCCCTG CAAACATCAG GCCCCCTCAC CACCACCGAT 
GTCTTCCTTT CGATCGCGAC GTTTGTAGTC CGGGGGAGTG GTGGTGGCTA 

32101 AGCAGTACCC TTACTATCAC TGCCTCACCC CCTCTAACTA CTGCCACTGG 
TCGTCATGGG AATGATAGTG ACGGAGTGGG GGAGATTGAT GACGGT6ACC 

32151 TAGCTTGGGC ATTGACTTGA AAGAGCCCAT TTATACACAA AATG6AAAAC 
ATCGAACCCG TAACT6AACT TTCTCGGGTA AATATGTGTT TTACCTTTTG 

32201 TAGGACTAAA GTACGGGGCT CCTTTGCATG TAACAGACGA CCTAAACACT 
ATCCTGATTT CATGCCCCGA GGAAACGTAC ATTGTCTGCT GGATTTGTGA 
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32251 TTGACCGTAG Q||CTGGTCC AGGTGTGACT ATTAATA^JH:rm:c?^|&A 
AACTGGCATC HRsACCAGG TCCACACTGA TAA7TAT7AT GA&GG^PST 

32301 AACTAAAGTT ACTGGAGCCT TGGGTTTTGA TTCACAAGGC AATATGCAAC 
TTGATTTCAA TGACCTCGGA ACCCAAAACT AAGTGTTCCG TTATACGTTG 

32351 TTAATGTAGC AGGA6GACTA AGGATTGATT CTCAAAACAG ACGCCTTATA 
AATTACATCG TCCTCCTGAT TCCTAACTAA GAGTTTTGTC TGCGGAATAT 

32401 CTTGATGTTA GTTATCCGTT TGATGCTCAA AACCAACTAA ATCTAAGACT 
GAACTACAAT CAATAGGCAA ACTACGAGTT TTGGTTGATT TAGATTCTGA 



32451 AGGACAGGGC CCTCTTTTTA TAAACTCAGC CCACAACTTG GATATTAACT 
TCCTGTCCCG GGAGAAAAAT ATTTGAGTCG GGTGTTGAAC CTATAATTGA 

32501 ACAACAAAGG CCTTTACTTG TTTACAGCTT CAAACAATTC CAAAAAGCTT 
TG T TGTTTCC GGAAATGAAC AAATGTCGAA GTTTGTTAAG GTTTTTCGAA 

32551 GAGGTTAACC TAAGCACTGC CAAGGGGTTG ATGTTTGACG CTACAGCCAT 
CTCCAATTGG ATTCGTGACG GTTCCCCAAC TACAAACTGC GATGTCGGTA 

32601 AGCCATTAAT GCAGG^iGATG GGCTTGAATT TGGTTCACCT AA.TGCACCAA 
TCGGTAATTA CGTCCTCTAC CCGAACTTAA ACCAAGTGGA TTACGTGGTT 

32651 ACACAAATCC CCTCAAAACA AAAATTGGCC ATGGCCTAGA ATTTGATTCA 
TGTGTTTAGG SGAGTTTTGT TTTTAACCGG TACCGGATCT TAAACTAAGT 

32701 AACAAGGCTA TGGTTCCTAA ACTAGGAACT GGCCTTAGTT TTGACAGCAC 
TTGTTCCGAT ACCAAGGATT TGATCCTTGA CCGGAATCAA AACTGTCGTG 

32751 AGGTGCCATT ACAGTAGGAA ACAAAAATAA TGATAAGCTA ACTTTGTGGA 
TCCACGGTAA TGTCATCCTT TGTTTTTATT ACTATTCGAT TGAAACACCT 

32B01 CCACACCAGC TCCATCTCCT AACTGTAGAC TAAATGCAGA GAAAGATGCT 
GGTGTGGTCG AGGTAGAGGA TTGACATCrG ATTTACGTCT CTTTCTACGA 

32851 AAACTCACTT TGGTCTTAAC AAAATGTGGC AGTCAAATAC TTGCTACAGT 
TTTGAGTGAA ACCAGAATTG TTTTACACCG TCAGTTTATG AACGATGTCA 

32901 TTCAGTTTTG GCTGTTAAAG GCAGTTTGGC TCCAATATCT GGAACAGTTC 
AAGTCAAAAC CGACAATTTC CGTCAAACCG AGGTTATAGA CCTTGTCAAG 

32951 AAAGTGCTCA TCTTATTATA AGATTTGACG AAAATGGAGT GCTACTAAAC 
TTTCACGAGT AGAATAATAT TCTAAACTGC TTTTACCTCA CGAT6ATTTG 

33001 AATTCCTTCC TGGACCCAGA ATATTGGAAC TTTAGAAATG GAGATCTTAC 
TTAAGGAAGG ACCTGGGTCT TATAACCTTG AAATCTTTAC CTCTAGAAT6 

33051 TGAAGGCACA GCCTATACAA ACGCTGTTGG ATTTATGCCT AACCTATCAG 
ACTTCCGTGT CGGATATGTT 7GCGACAACC TAAATACGGA T7GGATAGTC 

33101 CTTATCCAAA ATCTCACGGT AAAACTGCCA AAAGTAACAT TGTCAGTCAA 
GAATAGGTTT TAGAGTGCCA TTTTGACGGT rTTCATTGTA ACAGTCAGTT 

33151 GTTTACTTAA ACGGAGACAA AACTAAACCT GTAACACTAA CCATTACACT 
CAAATGAATT TGCCTCTGTT TTGATTTGGA CATTGTGAT? GGTAATGTGA 
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33201 AAAC66TACA q^pAAACAG GAGACACA^C TCCA&GTGtA TACTCSHIt 
TTTGCCATGT CTTCTTTGTC CTCTGTGTTC AGGTTCACGT ATGAGJ^CA 

33251 CATTTTCATG OGACTGGTCT GGCCACAACT ACATTAATGA AATATTTGCC 
GTAAAAGTAC CCTGACCAGA CCGGTGTTGA TGTAATTACT TTATAAACGG 

33301 ACATCCTCTT ACACTTTTTC ATACATTGCC CAAGAATAAA GAATCGTTTG 
TGTA66AGAA TGT6AAAAAG TATGTAAC6G GTTCTTATTr CTTAGCAAAC 

33351 TGTTATGTTT CAACGTGTTT ATTTTTCAAT T6CAGAAAAT TTCAACTCAT 
ACAATACAAA GTTGCACAAA TAAAAAGTTA ACGTCTTTTA AAGTTCAGTA 



33401 TTTTCATTCA GTAGTATAGC (XCACXTACCA CATAGCTTAT ACAGATCACC 
AAAAGTAAGT CATCATATCG GGGTGGTGGT GTATCGAATA TGTCTAGTGG 

33451 GTACCTTAAT CAAACTCACA GAACCCTAGT ATTCAACCTG CCACCTCCCT 
CATGGAATTA GTTTGAGTGT CTTGQGATCA TAAGTTGGAC GGTGGAGGGA 

33501 CCCAACACAC AGAGTACACA GTCCTTTCTC CCCGGCT66C CTTAAAAAGC 
GGGTTGTGTG TCTCATGTGT CA6GAAA6AG GGGCCGACCG GAATTTTTCG 

33551 ATCATATCAT GCGTAACAGA CATATTCTTA GGTGTTATAT TCCACACGGT 
TAGTATAGTA CCCATTGTCT GTATAAGAAT CCACAATATA AGGTGTGCCA 

33601 TTCCTGTCGA GCCAAACGCT CATCAGTGAT ATTAATAAAC TCCCCGCGCA 
AAGGACAGCT CCGTTTGCGA GTAGTCACTA TAATTATTTG AGQGGCCCGT 

33651 GCTCACTTAA GTTCATGTCG CTGTCCAGCT GCTGAGCCAC AGQCTGCTGT 
CGAGTGAATT CAAGTACAGC GACAGGTCGA CGACTCGGTG TCCGACGACA 

33701 CCAACTTGCG 6TTGCTTAAC GGGCGGC6AA GGAGAAGTCC AOGCCTACAT 
GGTTGAACCC CAACCAATTG CCCGCCGCTT CCTCTTCAGG TGCGGATGTA 

33751 GGGGGTAGAG TCATAATCGT GCATCA6GAT AGGGCGGTGG TQCTGCAGCA 
CCCCCATCTC AGTATTAGCA CGTACTCCTA TCCCiSCCACC ACGACGTCGT 

33801 GCGCGCGAAT AAACTGCTGC CGCCGCC6CT CCGTCCTGCA GGAATACAAC 
CGCGCGCTTA TTTGACGACG CCGQCGGCGA GGCAG6ACGT CCTTATGTTG 

33651 ATGGCAGTGG TCTCCTCAGC GATGATTCGC ACCGCCCGCA GCATAAGGCG 
TACCGTCACC AGAGGAOTCG CTACTAAGCG TGGCGGGCGT CGTATTCCGC 

33901 CCTTGTCCTC C6GGCACAGC AGCGCACCCT GATCTCACTT AAATCAGCAC 
GGAACAGGAG GCCC6TGTCG TCGCGTGGC3A CTAGAGTGAA TTTAGTCGTG 

33951 AGTAACTGCA GCACAGCACC ACAATATTCT TCAA2ATCCC ACAGTGCAA6 
TCATTGACGT CGTGTCGTGG TGTTATAACA AGTTTTAGG6 TGTCACGTTC 

34001 GCGCTGTATC CAAAGCTCAT G6CGGGGACC ACAGAACCCA CGTGGCCATC 
CGCGACATAG GTTTCGAGTA CCGCCCCTGC TGTCTTGG6T GCACCGGTAG 

34051 ATACCACAAG CGCAGGTAGA TTAAGTGQCG ACCCCTCATA AACACGCTGG 
TATGGTGTTC GCGTCCATCT AATTCACCGC TGGGGAGTAT TTGTGCGACC 

34101 ACATAAACAT TACXTTCTTTT GGCATGTTGT AATTCACCAC CTCCCGGTAC 
TGTATTTGTA ATGGAGAAAA CCGTACAACA TTAAGTCGTG GAGGGCCATG 
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34X51 CATATAAACC 
GTATATTTGG 




ATTAAA CATGGCGCCA TCCACCACCA-TXICTJ 
TAATTT GTACCQCOOT AGGTGGTGGT AGGA' 




34201 GCTGGCCAAA ACCTGCCCGC CGGCTATACA CTGCAGGGAA CCGGGACTGG 
CGACCGGTTT TGGACGGGCG GCCGATATGT GACGTCCCTT GGCCCTGACC 

34251 AACAATGACA GTGGAGAGCC CAGGACTC6T AACCATGGAT CATCATGCTC 
TTGTTACTGT CACCTCTCGG GTCCTGAGCA TTGGTACCTA GTAGTACGAG 

34301 GTCATGATAT CAATGTTGGC ACAACACAGG CACACGTGCA TACACTTCCT 
CAGTACTATA GTTACAACCG TGTTGTGTCC GTGTGGAC6T ATGTGAAGGA 



34351 CAGGATTACA AGCTCCTCCC GCGTTAGAAC CATATCCCAG GGAACAACCC 
GTCCTAATGT TCGAGGAGGG CGCAATCTTG GTATAGGGrc CCITGTTGGG 

34401 ATTCCTGAAT CAGCGTAAAT CCCACACTGC AGGGAAGACC TCGCACGTAA 
TAAGGACTTA 6TCGCATTTA GGGTGTGACG TCCCTTCT6G AGCGTGCATT 

34451 CTCACGTTGT GCATTGTCAA A6TGTTACAT TCGGGCAGCA GCGGATGATC 
GAGTGCAACA CGTAACAGTT TCACAATGTA A6CCCGTC6T CGCCTACTAG 

34501 CTCCAGTATG GTAGCGCGGG TTTCTGTC!PC AAAAGGAGGT AGACGATCCC 
GAGGTCATAC CATCGCGCCC AAAGACAGAG TTTTCCTCCA TCTGCTAGGG 

34551 TACTGTACGG AGTGCGCCGA GACAACCGAG ATCGTGTTGG TCGTAGTGTC 
A.TGACAT6CC TCACGCGGCT CTGTTGGCTC TAGCACAACC AGCATCACAG 

34601 ATGCCAAATG GAACGCCGGA CGTAGTCATA TTTCCTGAAG CAAAACCAGG 
TACGGTTTAC CTTGCGGCCT GCATCAGTAT AAAGGACTTC GTTTTGGTCC 

- -34651 TGCGGGCGTG ACAAACAGAT- CTGCGTCTCC -GGTCTCGCCG CTTAGATCGC 
ACGCCCGCAC TGTTTGTCTA GACGCAGAGG CCA6AGCGGC GAATCTAGCG 

34701 TCTGTGTAGT AGTTGTAGTA TATCCACTCT CTCAAAGCAT CCAGGCGCCC 
AGACACATCA TCAACATCAT ATAGGT6AGA eAGTTTCGTA GGTCCGCGGG 

34751 CCTG6CTTCG GGTTCTATGT AAACTCCTTC ATGCGCCGCT GCCCTGATAA 
GGACCGAAGC CCAAGATACA TTTGAGGAAG TACGCGGCGA CGGGACTATT 

34801 CATCCACCAC CGCAGAATAA GCCACACCCA GCCAACCTAC ACATTCGTTC 
GTAGGTGGTG GCGTCTTATT CGGTGTGGGT CGGTTGGATG TGTAAGCAAG 

34 851 TGCGAGTCAC ACACGGGAGG AGCGG6AAGA GCTGGAAGAA CCATGTTTTT 
ACGCTCA6TG TGTGCCCTCC TCGCCCTTCT CGACCTTCTT GGTACAAAAA 

• 34901 TTTTTTATTC CAAAAGATTA TCCAAAACCT CAAAATGAAG ATCTATTAAG 
AAAAAATAAG GOTTTCTAAT AGGTTTTGGA GTTTTACTTC TAGATAATTC 

34951 TGAACGCGCT CCCCTCCGGT GGCGTGGTCA AACTCTACAG CCAAAGAACA 
ACTTGCGCGA GGGGAGGCCA CCGCACCAGT TTGAGATGTC GGTTTCTT6T 

35001 GATAATGGCA TTTGTAAGAT * GTTGCACAAT GGCTTCCAAA AGGCAAACGG 
CTATTACCGT AAACATTCTA CAACGrGTTA CCGAAGGTTT TCCGTTT6CC 

35051 CCCTCACGTC CAACTGGACG TAAAGGCTAA ACCCTTCAGG GTGAATCTCC 
GSGAGTGCAG GTTCACCT6C ATTTCCGATT TGGGAA6TCC CACTTAGAGG 




91/144 



wo 02/22080 



PCT/USOl/28861 



35101 



TCTATAAACA 




.QCACC 



TTCAACCATG 



CCCAAATAAT TCTCA' 



AGATATTTGT aJBIgTCGTGG AAGTTG3TAC GGGTTTATTA AGACTAgRGC 

35151 CCACCTTCTC AATATATCTC TAAGCAAATC CCQAATATTA AGTCCGGCCA 
GGTGGAAGA6 TTATATAGAG ATTCGTTTAG GGCTTATAAT TCAGGCCGGT 

35201 TTGTAAAAAT CTGCTCCAGA GCGCCCTCCA CCTTCAGCCT CAAGCAGCGA 
A;iCATTTrTA GACGAGGTCT C6CGGGAGGT OG^GTCGGA GTTCGTCGCT 

35251 ATCATGATTG CAAAAATTCA GGTTCCTCAC AGACCTGTAT AAGATTCAAA 
TAGTACXAAC GTTTTTAA6T CCAAGGAGTG TCTGGACATA TTCTAAGTTT 



35301 AGCGGAACAT TAACAAAAAT ACCGCGATCC C6TAGGTCCC TTCGCAGGGC 
TCGCCTTGTA ATTGTTTTTA TGGCGCTAGG GCATCCAGGG AAGCGTCCC6 

35351 CAGCTGAACA TAATCGTGCA GGTCTQCACS GACCAGCGCG GCCACTTCCC 
GTCGACTTGT ATTAGCACGT CCAGACGTGC CTGGTCGCGC CGGTGAAGGG 

35401 CGCCAGGAAC CATGACAAAA GAACCCACAC TGATTATGAC ACGCATACTC 
GCGGTCCTTG GTACTGTTTT CTTGGGTGTG ACTAATACTG T6CGTATGAG 

35451 GGAGCTATGC TAACCAGCGT AGCCCCGATG TAAGCTTGTT GCATGGGCTGG 
CCTCGATACG ATTGGTCGCA TCGGGGCTAC ATTCGAACAA CGTACCCGCC 

35501 CGATATAAAA TGCAAGGTGC TGCTCAAAAA ATCAGGCAAA GCCTCGCGCA 
GCTATATTTT ACGTTCCACG ACGAGTTTTT TAGTCCGTTT CGGAGCGCGT 

35551 AAAAAGAAAG CACATCGTAG TCATGCTCAT GCAGATAAAG GCAGGTAAGC 
TTTTTCTTTC 6TGTAGCATC AGTACGAGTA CGTCTATTTC CGTCCATTCG 

35601 TCCGGAACCA CCACAGAAAA AGACACCATT TTTCTCTCAA ACATGTCTGC 
AGGCCTTGGT GGTGTCTTTT TCT6TOGTAA AAAGAGA6TT TGTACAGACG 

35551 GGGTTTCTGC ATAAACACAA AATAAAATAA CAAAAAAACA TTTAAACATT 
CCCAAAGACG TATTTGTGTT TTATTTTATT GTTTTTrPGT AAATTTGTAA 

35701 AGAAGCCTGT CTTACAACAG GAAAAACAAC CCTTATAAGC ATAAGACGQA 
TCTTCGGACA GAATGTTGTC CTTTTTCTTG G6AATATTCG TATTCTGCCT 

35751 CTACGGCCAT GCCGGCGTGA CCGTAAAAAA ACTG6TCACC GTGATTAAAft 
GATGCCGGTA CGGCCGCACT GGCATTTTTT TGACCAGTGG CACTAATTTT 

35801 AGCACCACCG ACAGCTCCTC GGTCATGTCC GGAGTCATAA TGTAAGACTC 
TC6TGGTGGC TGTCGAGGAG CCAGTACAGG CCTCAGTATT ACATTCTGAG 

35851 GGTAAACACA TCAGGTTGAT TCACATCGGT CAGTGCTAAA AAGCGACCGA 
CCATTTGTGT AGTCCAACTA AGTGTAGCCA GTCACGATTT TTCGCT6GCT 

35901 AATAGCCCGG GGGAATACAT ACCCGCAGGC GTAGAGACAA CATTACAGCC 
TTATCGGGCC CCCTTATGTA TGGGCGTCCG CATCTCTGTT GTAATGTCGG 

35951 CCCATAGGAG GTATAACAAA ATTAATAGGA GAGAAAAACA CATAAACACC 
GGGTATCCTC CATATTGTTT TAATTATCCT CTCTTTTTGT GTATTTGTGG 

36001 TGAAAAACCC TCCTGCCTAG GCAAAATAGC ACCCTCCCGC TCCAGAACAA 
ACTTTTTGG6 AGGACGGATC CGTTTTATCG TGGGAGGGCG AGGTCTTGTT 
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36051 



CATACAGCGC 




tACAGCG GCAGCCATAA 




GTATGTCGCG AAGGTGTCGC CGTCGCTATT GTCAGTCGGA ATGGTCATTT 

361D1 AAAGAAAACC TATTAAAAAA ACACCACTCG ACACGGCACC AGCTCAATCA 
TTTCTTTTGG ATAATTTTTT^ TGTGGTGAGC TGTGCCGTGG TCGAGTOAGT 

36151 GTCACA6TGT AAAAAAGGGC CAAGTGCAGA GCGAGTATAT ATAGGACTAA 
CAGTGTCACA TTTTTTCCCG GTTCACGTCT CGCTCATATA TATCCTGATT 

36201 AAAATGACGT AACGGTTAAA GTCCACAAAA AACACCCAGA AAACCGCACG 
TTTTACTGCA TOGCCAATTT CAGGTGTTTT TTGTGGGTCT TTPGGCGTGC 



36251 CGAACCTACG CCCAGAAACG AAAGCCAAAA AACCCACAAC TTCCTCAAAT 
GCTTGGATGC GGGTCTTTGC TTTCGGTTTT TTGGGTGTTG AA6GAGTTTA 

36301 CGTCACTTCC GTTTTCCCAC 6TTACGTCAC TTCCCATTTT AAGAAAACTA 
GCAGTGAAGG CAAAAGGGTG CAATGCAGTG AAGGGTAAAA TTCTTTTGAT 

36351 CAATTCCCAA CACATACAAG TTACTCCGCC CTAAAACCTA CGTCACXTCGC 
GTTAAGGGTT GTGTATGTTC AATGAGGCGG 6ATTTTGGAT GCAGTG6GCG 

36401 CCCGTTCCCA CGCCCCGCGC CACGTCACAA ACTCCACCCC CTCATTATCA 
GGGCAAGGGT GCGGGGCGCG GTGCAGTGTT TGAGGTGGGG GAGTAATAGT 



36451 TATTGGCTTC AATCCAAAAT AAGGTATATT ATTGATGATG TTAATTAAGA 
ATAACCGAAG TTAGGTTTTA TTCCATATAA TAACTACTAC AATTAATTCT 

36501 A7TCGGATCT GC6ACGCGAG CTTCCCCATT ATGATTCTTC 

TAAGCCTAGA CGCTGC6CTC CGACCTACCG' GAAGGGGTAA TACTAAGAAG" 

36551 TCGCTTCCGG CGGCATCGGG ATGCCCGCGT TGCAGGCCAT GCTGTCCAGG 
AGCGAAGGCC GCCGTAGCCC TACGGGCGCA ACGTCCGGTA CGACAGGTCC 

36601 CAGGTAGATG ACGACCATCA GGGACAGCTT CAAGGCCAGC AAAAGGCCAG 
GTCCATCTAC TGCTG6TAGT CCCTGTCGAA GTTCCGGTCG TTTTCCGGTC 

36651 GAACCGTAAA AAGGCCGCGT TGCTGGCGTT TTTCCATAGG CTCCGCCCCC 
CTTGGCATTT TTCCGGCGCA ACGACCGCAA AAAGGTATCC GAGGCGGGGG 

36701 CTGACGAGCA ?CACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACCCG 
GACTGCTCGT AGTGTTTTTA GCTGC6AGTT CAGTCTCCAC COCTTTGGGC 

36751 ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAA6CT CCCTCGTGCG 
TGTCCTGATA TTTCTATGGT CCGCAAAGGG GGACCTTCGA GGGAGCACGC 

36801 CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 
GAGAGGACAA 66C7GGGACG GCGAATGGCC TATGGACAGG CGGAAAGAGG 

3 6851 CTTCGGGAAG CGTGGCGCTT TCTCATAGCT CACGCTGTAG GTATCTCAGT 
GAAGCCCTTC GCACCGCGAA AGAGTATCGA GTGCGACATC CATAGAGTCA 

36^01 TCGGTGTAGG TCGTTCGCTC CAAGCTOGGC TGTGTGCACG AACCCCCCGT 
AGCCACATCC AGCAAGCGAG GTTCGACCCG ACACACGTGC TTGGGGGGCA 
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36951 TCAGCCCGAC Q^pGCGCCT TATCCGGTAA CTATCGTCTt'^ GA6TCQj«CC 
AGTCGGGCTG gIRcQCGGA ATAGGCCATT GATAGCAGAA CTCAGcJWgg 

37001 CGGTAAGACA CGACTTATCG CCACTGGCAG CAGCCACTGG TAACAGGATT 
QCCATTCTGT 6CT6AATAGC GGTGACCGTC GTCGGTGACC ATTGTCCTAA 

37051 ' AGCAGAGCGA GGTATGTAGG CCGTGCTACA GAGTTCTTGA AGTGGT6GCC 
TCGTCTCGCT CCATACATCC GCCACGATGT CTCAAGAACT TCACCACCGG 

37101 TAACTACGGC TACACTAGAA GGACAGTATT TG6TATCT5C GCTCTGCTGA 
ATTGATGCCG ATGTGATCTT CCTGTCATAA ACCATAGACG CGAGACGACT 

37151 AGCCAGTTAC CTTCGGAAAA AGA6TTGGTA GCTCTTGATC CGGCAAACAA 
TCGGTCAATG GAAGCCTTTT TCTCAACCAT CGAGAACTAG GCCGTTTGTT 

37201 ACCACCGCTG 6TAGCGGTGG TTTTTTTGTT TGCAAGCAQC AGATTACGCG 
TGGTGGCGAC CATCGCCACC AAAAAAACAA ACGTTC6TCG TCTAATGCGC 

37251 CAGAAAAAAA GGATCTCAAG AAGATCCTTT GATCTTTTCT ACG6GGTCTG 
GTCTTTTTTT CCTAGAGTTC TTCTACGAAA CTAGAAAAGA TGCCCCAGAC 

37301 ACGCTCAGTG GAACGAAAAC TCACGTTAAG GGATTTTGGT CATGAGATTA 
TGCGAGTCAC CTTGCTTTTG AGTOCAATTC CCTAAAACCA 6TACTCTAAT 

37351 TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATCAATCTA AAGTATATAT 
AGTTTTTCCT AGAAGTGGAT OTAGGAAAAT TTAGTTAGAT TTCATATATA 

37401 GAGTAAACTT GGTCTGACAG TTACCAATGC TTAATCAGTG AGGCACCTAT 
CTCATTTGAA CCAGACTGTC AATGGTTACG AATTAGTCAC TCCGT6GATA 

37451 CTCAGCGATC 'TGTCTATTTC GTICATCCAT A6TTGCCTGA CTCCCCGTCG 
GAGTCGCTAG ACAGATAAAG CAAGTAGGTA TCAACGGACT GAGGGGCAGC 

37501 TGTAGATAAC TACGATACGG GAGG6CTTAC CATCTGGCCC CAGTGCTCCA 
ACATCTAT-TG ATGCTATGCC CTCCCGAATG GTAGACCGGG GTCACGACGT 

37551 ATGATACCGC GAGACCCACG CTCACCGGCT CCAGATTTAT CAGCAATAAA 
TACTATGGCG CTCTGGGTGC GAGTGGCCGA GGTCTAAATA GTCGTTATTT 

37 601 CCAGCCAGCC GGAAGGGCCG AGCGCAGAA6 TGGTCCTGCA ACTTTATCCG 
GGTCGGTCGG CCTTCCCGGC TCGCGTCTTC ACCAGGACGT TGAAATAGGC 

37651 CCTCCATCCA GTCTATTAAT TGTTGCCGGG AAGCTAGAGT AAGTAGTTCG 
GGAGGTAGGT CAGATAATTA ACAACGGCCC TTC6ATCTCA TTCATCAAGC 

37701 CCAGTTAATA GTTTGCGCAA CGTTGTTGCC ATTGCTACAG GCATCGTGGT 
GGTCAATTAT CAAACGCGTT GCAACAACGG TAACGATGTC CGTAGCACCA 

37751 GTCACGCTCG TCGTTTGGTA TGGCTTCATT CAGCTCCGCT TCCCAAC6AT 
CAGTGCGAGC AGCAAACCAT ACCGAAGTAA GTCGAGGCCA AGGGTTGCTA 

37801 CAAGGCGAGT TACATGATCC CCCATGTTGT GCAAAAAAGC GGTTAGCTCC 
6TTCC6CTCA ATGTACTAGG GGGTAC3ACA CGTTTTTTCG CCAATCGAGG 

37B51 TTCGGTCCTC CGATCGTTGT CAGAAGTAAG TTGGCCGCAC TGTTATCACT 
AAGCCAGGAG QCTAGCAACA GTCTTCATTC AACCGGCGTC ACAATAGTGA 
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37901 CJITGGTTATG (ApCACTGC 

37951 GATGCTTTTC TGTGACTGGT 
CTACGAAAAG ACACTGACCA 



ATAATTCTCT TACTGTCATG'^CATCqB^ ' 
TATTAAGAGA ATGACA6TAC G6TAG(^nT 

GAGTACTCAA CX:AAGTCATT CTGAGAATAG 
CTCATGAGTT GGTTCAGTAA GACTCTTATC 



38001 TGTATGC6GC GACCGAGTTG CTCTTGCCCG GCGTCAACAC 6G6ATAATAC 
ACATACGCCG CTGGCTCAAC GAGAACGGGC CGCAGTTGTG CCCTATTATG 

38051 CGCGCCACAT AGCAGAACTT TAAAAGTGCT CATCATTGGA AAACGTTCTT 
GCGCGGTGTA TCQTCTTGAA ATTTTCACGA GTAGTAACCT TTTGCAAGAA 

38101 C6GGGCGAAA ACTCTCAA6G ATCTTACCGC TGTTGAGATC CA6TTCGAT6 
6CCCCGCTTT TGAGAGTTCC TAGAATGGCG ACAACTCTAS GTCAAGCTAC 

3B151 TAACCCACTC GTGCACCCAA CTGATCTTCA GCATCTTTTA CTTTCACCAG 
ATTGGGTGAG CAC6TG6GTT GACTAGAAGT CGTAGAAAAT 6AAAG7GGTC 

3B2G1 C6TTTCTGG6 TGAGCAAAAA CAGGAAGGCA AAATGCCGCA AAAAAGGGAA 
0CAAA6ACCC ACTCGTTTTT GTCCTTCCGT TTTACGGCGT TTTTTCCCTT 

■ 

38251 TAAGGGCGAC ACGGAAATGT TGAATACTCA TACTCTTCCT TTTTCAATAT 
ATTCCCGCTG TGCCTTTACA ACTTATGAGT ATGAGAAGGA AAAAGTTATA 

38301 TATTGAAGCA TTTATCAGGG TTATTGTCTC ATGAGCGGAT ACATATTTGA 
ATAACTTCGT AAATAGTCCC AATAACAGAG TACTCGCCTA TGTATAAACT 

3835 1 ATGTATTTAG AAAAATAAAC AAATAGGGST TCCGCGCACA TTTCCCCGAA 
TACATAAATC TTTTTATTT6 TTTATCCCCA AGGCGCGTGT AAAGGGGCTT 

38401 AAGTGCCACC TGACGTCTAA GAAACCATTA TTATCATGAC ATTAACCTAT 
TTCACGGTGG ACTOCAGATT CTOTCQTAAT-AATAGTACTG TAATTGGATA 

3 84 51 AJU^AATAGGC GTATCACGAG GCCCTTTCGT CTTCAAGAAT TGGATCCGAA 
TTTTTATCCG CATAGTSCTC CGQGAAAGCA 6AA6TTCTTA ACCTAGGCTT 



Pad 



38501 TTCTTAATTT CTrAATTAA (SEQ ID NO: 32) 
AAGAATTAAA GAATTAATT (SBQ ID NO:33) 



1^ 



95/144 



wo 02/22080 



PCT/USOl/28861 



MRKAd5iicfMER1063 
(MRKAdS Pre-Adenoviral Vector Containing the G2A»LLA nef Coding Region) 



1 CATCATCAAT AATATACCTT ATTTTGC3ATT GAAGCCAATA TGATAATGAG 
GTAGTAGTTA TTATATOGAA TAAAACCTAA CTTCGGTTAT ACTATTACTC 

51 GQGGTGGAGT TrGTQACGTG 6CGCGGGGCG TGGGAACGGG GCCCGTGACC 
CCCCACCTCA AACACTGCAC CGCGCCXTGC ACCCTTGCCC CGCCCACTGC 

101 TAGTAGTGTO 6CGGAAGTGT GATGTOXSCAA GTGTGGCGGA ACACATGTAA 
ATCATCACAC CSCCTTCACA CTACAACGTT CACACCGCCT TGTGTACATT 

151 6CGACGGAT0 T3GCAAAAGT GACGTTTTTG GTGTGCGCCG GTGTACACAG 
C6CTGCCTAC ACCGTTTTCA CTCCAAftAAC CACACGCGGC CACATGTGTC 

201 CAAGTGACAA TTTTCQCGCG GTTTTAGGCG GATGTTGTAG TAAATTTGGG 
CTTCACTGTT AAAAGOGCGC CAAAATCCGC CTACAACATC ATTTAAACCC 

251 CGTAACCGAG TAAGATTTGG CCATTTTCGC GGGAAAACTG AATAAGAGGA 
GCATTGGCTC ATTCTAAACC GGTAAAAGCG CCCTTTTGAC TTATTCTCCT 

301 AGTGAAATCT GAACAATTTT GTGTTACTCA TAGCGCGTAA TATTTGTCTA 
TCACTTTAGA CTTATTAAAA CACAATGA6T ATCGCGCATT. ATAAACAGAT 

351 6GGCCGCGGG GACTTTGACC 6TTTACGTGG A6ACTCGCCC AGGTGTTTTT 
CCC6GCGCCC CTGAAACTGG CAAAT6CACC TCTGAGCGGG TCCACAAAAA 

401 CTCA6GTGTT TTCCGCGTTC C6GGTCAAAG TTGGCGTTTT ATTATTATAG 
6AGTCCACAA AAGGCGCAAG GCCCAGTTTC AACCGCAAAA TAATAATATC 

451 GCGGCCGCGA TCCATTGCAT ACGTTGTATC CATATCATAA TATGTACATT 
CGCCGGCGCT AGCTAACGTA TGCAACATAG GTATAGTATT ATACATGTAA 

« 

501 TATATTGGCT CATGTCCAAC ATTACCGCCA TGlTCACATr GATTATTGAC 
ATATAACCGA GTACAGGTTG TAATGGCGGT ACAACTGTAA CTAATAACTG 

551 TAGTTATTAA TAGTAATCAA TTACGG06TC ATTAGTrCAT AGCCCATATA 
ATCAATAATT ATCATTAGTT AArtSCCCCAG TAATCAAGTA TCGGGTATAT 

601 TGGAGTTGC6 CGTTACATAA CTTACG6TAA ATGGCCCGCC TGGCTGACCG 
ACCTCAAGGC GCAATGTATT GAATGCCATT TACCGGGCGG ACCGACTGGC 

651 CCCAACGACC CCCGCCCATT GACGTCAATA ATGACGTATG TTCCCATAGT 
GGGTTGCTGG GGGCGGGTAA CTGCAGTOAT TACTGCATAC AAGGGTATCA 

701 AACGCCAATA GGGACTTTCC ATTGACGTCA ATGGCTQGAG TATTTACGGT 
TTGCGGTTAT CCCTGAAAGG TAACTGCAGT TACCCACCTC ATAAATGCCA. 

751 AAACTGCCCA CTTGGCAGTA CATCAAGTGT ATCATATGCC AAGTACGCCC 
TTTCACGGGT GAACCGTCAT GTAGTTCACA lAGTATACGG TTCATGCGGG 

801 CCTATT6AC6 TCAATGACGG TAAATGGCCC GCCTGGCATT AT6CCCAGTA 
GGATAACTGC A6TTACTGCC ATTTACCGGG CGGACCGTAA TACG60TCAT 
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851 



CATGACCTTA 



.CTTTC 



CTACTTGGCA GTACATCTAC 



GTATTj 



GTACTG6AAT aRCTGAAAG GATGAACCGT CAT6TAGA7G CATAATRGT 

901 TCGCTATTAC CATGGTGATG CGGTTTTGGC AGTACATCAA TGGGCGTGGA 
AGCGATAATG GTACCACTAC GCCAAAWTCG TCATGTAGTT ACCCGCACCT 

S51 TAGCX3GTTTG ACTCAC6GGG ATTTCCAAGT CTCCACCCCA TTGACGTCAA 
ATCGCCAAAC TGAGTGCCCC TA^^AGGTTCA GI^GGTGGGGT AACTGCAGTT 

1001 TGGG^GTTTG TTTTGGCACC AAAATCAACG GGACTTTCCA MATGTCGTA 
ACCCTCAAAC AAAACCGTGG TTTTAGITGC CCTGAAAGGT TTTACAGCAT 

1051 ACA^^TCCGC CCCATTGAC6 CAAATGG6CG GTAGGCGTGT AC6GT6GGAG 
TGTTGA6GCG 6GGTAACTGC GTTTACCCGC CATCCGCACA TGCCACCCTC 

1101 GTCTATATAA GCAGAGCTC6 TTTAGTGAAC CGTCAGATCG CCTGGAGACG 
CAGATATATT CGTCTCGA6C AAATCACTTG GCAGTCTAGC GGACCTCTGC 

1151 CCATCCACGC TGTTTTGACC TCCATAGAAG ACACCGGGAC CGATCCAGCC 
GGTAGGTGCG ACAAAACTGG AGGTATCTTC TGTGGCCCTG GCTAGGTCGG 

1201 TCCGCGGCCG OGAACOGTGC ATTGGAACGC G6ATTCCCCG TGCCAA6AGT 
AGGCGCCGGC CCTTGCCACG TAACCTTGCG CCTAAGGGGC ACGGTTCTCA 

1251 GAGATCTGCC ACCATGGCCG GCAAGK3GTC CAAGAGGTCC GTGCCCGGCT 
CTCTAGACGG TGCTACCGGC CGTTCACCAG GTTCTCCAGG CACGGGCCGA 

1301 6GTCCACCGT GAGGGA6AGG ATGAGGAGGG CCGAGCCCGC CGCCGACAGG 
CCAGGTGGCA CTCCCTCTCC TACTCCTCCC GGCTCGGGCG GCGGCTGTCC 

1351 CT6AGGAGGA CCGAGCCCGC CGCAGTGGGC GTGGGCGCCG TGTCCAGGGA 
CACTCCTCCT GGCTCGGGCG GCGTCACCCG CACCCGCGGC ACAGGTCCCT 

1401 CCTGGAGAAG CACGGC6CCA TCACCTCCTC CAACACCGCC GCCACCAACG 
GGACCTCTTC 6TGCCGCGGT AGTGGAG6AG GTTGTGGCGG CGGTGGTTGC 

1451 CC6ACTGCGC CTGGCTGGAG GCCCAGGAGG ACGAGGAGGT GGGCTTCCCC 
GGCTGACGCG GACCGACCTC CGGGTCCTCC TGCTCCTCCA CCCGAAGGGG 

1501 GTGAGGCCCC AGGTGCCCCT GAGGCCCATG ACCTACAASG GCGCCGTGGA 
CACTCCGGGG TCCAC66GGA CTCCGGGTAC TGGATGTTCC CGCOGCACCT 

1551 CCTGTCCCAC TTCCTGAAGG AGAAGGGCGG CCTGGAGGGC CTGATCCACT 
GGACAGGGT6 AAGGACTTCC TCTTCCCGCC GGACCTCCCG GACTAGGTGA 

1601 CCCAGAA6AG QCAGGACATC CTQGACCTGT 06GTGTACCA CACCCAGGGC 
OGGTCTTCTC CGTCCT6TA6 GACCT6GACA CCCACATGGT GTGGGTCCCG 

1651 TACTTCCCCG ACTGGCAGAA CTACACCCCC GGCCCCGGCA TCAGGTTCCC 
ATGAAGGG6C T6ACCGTCTT GATGTGGGGG CCOGGGCCGT ASTCCAAGGG 

1701 CCTGACCTTC GGCTGGTGCT TCAAGCtGGT GCCCGTGGAG CCCGAGAAGG 
GGACTGGAAG CCGACCACGA AGTTC6ACCA C6GGCACCTC GGGCTCTTCC 

1751 TGGAGGAG6C CAACGACCCC CMIGAACAACT QCSCCGCCCA CCCCATGTCC 
ACCTCCTCCG GTTGCTCCCG CTCTTGTTGA CGCGGCGGGT GGGGTACAGG 
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1801 CAGCACG6CA '^^^OGACCC CGAGAAGGAG GTGCTGGAGT GGAGGOflKA 
GTCGTGCCGT MCTCCTGGG GCTCTTCCTC CACGACCTCA CCTCCmRt 

1851 CTCCAAGCTO GCCTTCCACC ACGTGGCCAG GGAGCTGCAC CCCGAGTACT 
GAGGTTCGAC CGGAAGGTGG TGCACOGGTC CCTCGACGTC GGGCTCAT6A 

1901 ACAAGGACTG CTAAAGCCCG GGCAGATCTG CTGTOCCTTC TAGTTGCCAG ' 
TGTTCCTGAC GATTTCGGGC CCGTCTAGAC GACACGGAAG ATCAACGGTC 

1951 CCATCTGTTG TTTGCCCCTC CCCCGTGCCT TCCTTGACCC TGGAAGGTGC 
GGTAGACAAC AAACGGGGAG GGGGCACGGA AGGAACTGGG ACCTTCCACG 

2001 CACTCCCACT GTCCTTTCCT AATAAAATGA GGAAATTGCA TCGCATT6TC 
GTGAGG6TGA CAGGAAAGGA TTATTTTACT CCTTTAACGT AGC6TAACAG 

2051 TGAGTAGGTG TCATTCTATT CTGGGQGGTG GGGTGGGGCA GGACAGCAAG 
ACTCATCCAC AGTAAGATAA GACCCCCCAC CCCACCCCGT CCTGTCGTTC 

r 

2101 GGGGAG6ATT GGGAAGACAA TAGCAGGCAT GCTGGGGATG CGGTGG6CTC 
CCCCTCCTAA CCCTTCTGTT ATCGTCCGTA CGACCCCTAC GCCACCCGAG 

2151 TATGGCCGAT CGGCGCGCCG TACTGAAATG TGTGGGCGT6 GCTTAAGGGT 
ATACC6GCTA GCCGCGCGGC ATGACTTTAC ACACCCQCAC CGAATTCCCA 

2201 GGGAAAGAAT ATATAAGGTG GGGGTCTTAT GTAGTTTTGT ATCTGTTTTG 
CCCTTTCTTA TATATTCCAC CCCCAGAATA CATCAAAACA TAGACAAAAC 

2251 avGCAGCCGC CGCCGCCATG AGCACCAACT CGTTTGATG6 AAGCATTGTG 
GTCGTCGGCG QCGGCG6TAC TCGTGGTT6A GCAAACTACC TTCGTAACAC 

2301 A3CTCATATT TGACAACGCG CATGCCCCCA TGGGCCGGGG TGCGTCAGAA 
TCGA6TATAA ACTGTTGCGC GTACGGGGGT ACCCGGCCCC ACGCAGTCTT 

2351 TGTGATGGGC TCCAGCATTG ATGGTCGCCC CGTCCTGCCC GCAAACTCTA 
ACACIACCCG AGGTCGTAAC TACCAGCGGG GCAGGACGGG CGTTTGAGAT 

2401 CTACCTTGAC CTACGAGACC GTGTCTGGAA CGCCGTTGGA GACTGCAGCC 
GATGGAACTG 6ATGCTCTGG CACAGACCTT GCGGCAACCT CTGACGTCGG 

2451 TCCGCCGCCG CTTCAGCCGC TGCAGCCACC GCCCGCGGGA TTGTGACTGA 
AGGCGGCGGC GAAGTCGGCG ACGTCGGTGG CGGGCGCCCT AACACTGACT 

2501 CTTTGCTTTC CTGAGCCCGC TTGCAAACAG TGCAGCTTCC CGTTCATCCG 
GAAACGAAAG GACTCGGGCG AACGTTTGTC AC6TCGAAGG GCAAGTAGGC 

2551 CCCGCGATGA CAAGTTGACG GCTCTTTTGG CACAATTGGA TTCTTTGACC 
GGGCGCTACT GfTTCAACTGC CGAGAAAACC GTGTTAACCT AAGAAACTGG 

2601 CGGGAACTTA ATGTCGTTTC TCAGCAGCTG TTGGATCTGC GCCAGCAGGT 
GCCCTTGAAT TACAGCAAA6 AGTCGTCGAC AACCTA6ACG CGGTCGTCCA 

2651 TTCTGCCXTTG AAGGCTTCCT CCCCTCCCAA TGCGGTTTAA AACATAAATA 
AAGACGG6AC TTCC6AAGGA GGG6AGGGTT ACGCCAAATT TTGTATTTAT 

2701 AAAAACCAGA CTCTQTTTGG ATTTGGATCA AGCAAGTGTC TTGCTGTCTT 
TTTTT6GTCT GAGACAAACC TAAACCTAGT TCGTTCACAG AACGACAGAA 
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2751 

2801 

2851 

2901 

2951 

3001 

3051 

3101 

3151 

3201 

3251 

3301 

3351 

3401 

3451 

3501 

3551 

3601 

3651 



TATTTAGGGG T9nGC6C6C GCGGTA6GCC CGG6ACCAGC GGTCTCSRC 
ATAAATCCCC A&AACGCGCG CGCCATCCGG GCCCTGGTCG CCAGAGCCAG 

GTTGAGGGTC CTGTGTATTT TTtCCAGGAC GTGGTAAA6G TGACTCTGGA 
CAACTCCCAG GACACATAAA A^kAGGTCCTG CACCATTTCC ACTGAGACCT 

TGTTCAGATA CATGG6CATA AGCCCGTCTC TGG6GTGGAG STAGCACCAC 
ACAAGTCTAT GTACCC6TAT TCGG6CAGAG ACCCCACCTC CZATCGTGGTG 

TOCAGAGCTT CATGCT6CGG GGTGGT6TT6. TAGA7GATCC AGTCGTAGCA 
ACGTCTCGAA GTACGACGCC CCftCCACAAC ATCTACTAG6 TCAGCATCGT 

GGAGCGCTGG GCGTGGTGCC TAAA2JIT6TC TTTCAGTAGC AAGCTGATTG 
CCTCCOGACC CGCACCACGG ATTTTTACAG AAAGTCATC6 TTCGACTAAC 

CCAGGGGCAG GCCCTTGGTG TAAGTGTTTA CAAA6CGGTT AAGCTG6GAT 
GGTCCCCGTC CGGGA?^CAC ATTCACAAAT GTTTCGCCAA TOCGACCCTA 

GGGTGCATAC GTGGGGA7AT GASATGCATC TTGGACTGTA TTTTTAG6TT 
CCCACGTATG CACCCCTATA CTCTACGTAG AACCTGACAT AAAAATCCAA 

GGCTATGTTC CCAGCCATAT CCCTCCGCG6 ATTCATGTTG TGCAGAACCA 
CCGATACAAG GGTCGGTATA GGGAGGCCCC TAAGTACAAC ACGTCTTGGT 

CCAGCACAGT GTATCCOGTG CACTTGGGAA ATTTGTCATG TAGCTTAGAA 
GGTCGT6TCA CATAGGCCAC GTGAACCCTT TAAACAGTAC ATCGAATCTT 

GGAA«ITGCGT G3AAGAACTT GGAGACGCCC TTGTGACCTC CAA6ATTTTC 
CCTTTACGCA CCTTCTTGAA CCTCTGCGGG AACACTGGAG GTTCTAAAAG 

CATGCATTCG TCCATAAT6A TGGCAATGGG" CCCACGGGCG GCGOCCTGGG 
GTACGTAAGC AGGTATTACT ACCGTTACCC GGGTGCCCGC CGCCGGACCC 

CGAAGATATT TCTGGGATCA CTAACGTCAX AGTTGTGTTC CAGGATQAGA 
GCTTCTATAA AGACCCTA6T GATTGCAGTA TCAACACAAG CTCCTACTCT 

TCGTCATAGG CCATTTTTAC AAAGCGCGGG CG6AGGGT6C CAGACTGCGG 
AOCAGTATCC GGTAAAAATG TTTCGCGCCC GCCTCCCACG GTCTGACGCC 

TATAATGGTT CCATCCGGCC CACGGGCGTA GTTACCCTCA CAGATTTQCA 
ATATTACCAA QGTA6GCCG6 GTCCCCGCAT CAATGGGAGT GTCTAAACGT 

TTTCCCAC6C TTTGAGTTCA GATGGGGG6A TCATGTCTAC CTGCGGGQCG 
AAAGGGTGCG AAACTCAAGT CTACCCCCCT AGTACAGATG GACGCCCCGC 

ATGAMS^AAA CGGTTTCCG6 GGTAGGGGAG ATCAGCTGGC AA6AAAGCAG 
TACTTCTTTT GCCAAAGGCC CCATCCCCTC TAGTCGACCC TTCTTTCGTC 

GTTCCTGA6C AGCTGC6ACT TACCGCAGCC GGTG6GCCCG TAAATCACAC 
CAAGGftCTCG TCGACGCTGA A7GGCGTCGG CCACCCGGGC ATITAGTGTG 

CTATTACCGG CTGCAACTGG TAGTTAAGAG AGCTGCAGCT GCCGTCATCC 
GATAATGGCC GACGTT6ACC ATCAATTCTC TC6ACGTCGA CGGCAGTAGG 

CTGAGCAGGG GGGCCACTTC GTTAAGCATG TCCCTGACTC GCATGTTTTC 
GACTCGTCCC CCCGGTGAAG CAATTCGTAC AGGGACTGAG CGTACAAAAG 
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3701 CCTGACCAAA ^^BSCCAGAA GGCGCTCGCC OCCCAGCGAT AGCAG'lJpT 
GGACTGGTTT AGGCGtSTCTT CCGCGAGCGG CGGGTCGCTA TCGTCAAGAA 

3751 GCAAGGAAGC AAAGTTTTTC AACGGTTTGA GACCGTCCGC C6TAGQCATG 
CGTTCCTTCG TTTCAAAAAG TTGCCAAACT CTGGCAGGCG GCATCCGTAC 

3B01 CTTTTGAGCG TTTGACCAAG CAGTTCCAGG CGGTCCCACA GCTCGGTCAC 
GAAAACTCGC AAACTGCTTC GTCAACCTCC CCCAGGGTGT CGAGCCACTG 

3851 CTGCTCTACG GCATCTCGAT CCAGCATATC TCCTCGTTTC GCGGGTTGG6 
GACGAGATGC CGTAGAGCTA GGTCGTATAG AGGAGCAAAQ CGCCCAACCC 

3901 GCGGCTTTCG CTGTACGGCA GTAGTCGGTG CTCGTCCAGA CGGGCCAGG6 
COCCGAAAGC GACATCCCGT CATCAGCCAC GAGCA6GTCT 6CCCGGTCCC 

3951 TCATGTCTTT CCACGGGCGC AGGGTCCTCC TCAGCCTAGT CTGGGTCACG 
AGTACAGAAA GGTGCCC6CG TCCCAGGAGC AOTCGCATCA GACCCAGTGC 

4001 GTGAAGGGGT GCGCTCCGG6 CTGCGCGCTG 6CCAGGGTGC GCTTGAGGCT 
CACTTCCCCA CGCGAGGCCC GACGCCCGAC CGGTCCCACG CGAACTCCGA 

4051 GGTCCTGCTG GTGCTGAAGC GCTGCCGGTC TTCGCCCTGC GCGTCGGCCA 
CCAGGACGAC CACGACTTCG CGACGGCCAG AAGCGGGACG CGCAGCCGGT 

4101 GGTAGCATTT GACCATGGTG TCATAGTCCA GCCCCTCCGC GGCGTGGCCC 
CCATCGTAAA CTGGTACCAC AGTATCAGCT CGGGGAGGCG CCGCACCGGG 

4151 TTGGCGCGCA GCTTGCCCTT GGAGGAGGCG CCGCACGAGG GGCAGTGCAG 
AACCGCGCGT CGAACGGGAA CCTCCTCCGC GGCGTGCTCC CCGTCACGTC 

4201 ACTTTTGAGG GCGTAGAGCT TGGGCGCGAG AAATACCGAT TCCGGGGAGT . 
TGAAWVCTCC CGCATCTCGA ACCCGC6CTC TTTATG6CTA ACGCCCCTCA 

4251 AGGCATCCGC GCCGCAGGCC CCGCAGACGG TCTCGCATTC CACGAGCCAG 
TCCGTAGGCG CG6CGTCCGG GGCGTCTGCC AGAGCGTAAG GTGCTCGGTC 

4301 GT6AGCTCTG GCCGTTCGGG GTCAAAAACC AGGTTTCCCC CATGCTTTTT 
CACTCGAGAC CGGCAAGCCC CAGTTTTTOG TCCAAAGGGG GTACGAAAAA 

4351 GATGCGTTTC TTACCTCTGG TTTCCATGAG CCGGTGTCCA CGCTCCGTGA 
CTACGCAAAG AATGGAGACC AAAGGTACIC GGCCACAGGT GCGAGCCACT 

4401 CGAAAAGGCT GTCCGTGTCC CCGTATACA6 ACTTGAGAGG CCTGTCCTCG 
GCTTTTCCGA CAGGCACAGG GGCATATGTC TGAACTCTCC GGACAGGAGC 

4451 AGC6GTGTTC CGCGGTCCTC CTCGTATAGA AACTCGGACC ACTCTGAGAC 
TCGCCACAAG 6CGCCAGGAG GAGCA7ATCT TTGAGCCTGG TGAGACTCTG 

4501 AAAGGCTCGC GTCCAGGCCA GCAC6AAGGA GGCTAAGTGG GAGGGGTAGC 
TTTC06AGCG CAGGTCCGGT CGTGCTTCCT CCGATTCACC CTCCCCATCG 

4551 GGTCGTTGTC CACTAGGGOG TCCACTCGCT CCAGGGTCTG AAGACACATG 
* CCAGCAACAG 6TGATCCCCC AGGTGAGCX^A GGTCCCACAC TTCT6TGTAC 

4601 TCGCCCTCTT CGCCATCAAG GAAGGTGATT GGTTTGTAGG TGTAGGCCAC 
AOCGGGAGAA 6CCGTAGTTC CTTCCACTAA CCAAACATCC ACATCCGGTG 
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4551 6TGACCGGGT ^ptCTGAAG GGGGGCTATA AA;i6666G'rG GGGGCd^TT 
CACTGGCCCA CAAGGACTTC CCCCCGATAT TTTCCCCCAC CCCCGCGCAA 

4701 CGTCCTCACT CTCTTCCGCA TCGCTGTCTG CGAGGGCCAG CTGTTGGGGT 
GCAGGAGTGA GAGAAGGCGT AGCGACAGAC GCTCCCGGTC GACAACCCCA 

4751 GA6TACTCCC TCTGAAAAGC GGGCATGACT TCTGCGCTAA GATTGTCAGT 
CTCATGAGGG AGACTTTTCG CCCGTACTGA AGACGCGATT CTAACAGTCA 

4801 TTCCAAAAAC GAGGAGGATT TGATATTCAC CTGGCCCGCG GTCATGCCTT 
AAGGTTTTTG CTCCTCCTAA ACTATAAGTG GACCGGGCGC CACTACGGAA 

4851 TGAGGGTGGC CGCATCCATC TGGTCAGAAA AGACAATCTT TTTGTT6TCA 
ACTCCCACCG GCGTAGGTAG ACCAGTCTTT TCTGTTAGAA AAACAACAGT 

4901 AGCTTGGTGG CAAACGACCC GTAGAGGGCG TTGGACAGCA ACTTCGCGAT 
TCGAACCACC GTTTGCTGGG CATCTCCCGC AACCTGTCGT TGAACCGCTA 

4951 GGAGCGCAGG GTTTGGTTTT TGTCGCGATC GGCGCGCTCC TTGGCCGCGA 
CCTCGCGTCC CAAACCAAAA ACAGCGCTAG CCGCGCGAGG AACCGGCGCT 

5O01 . TGTTTAGCTG CACGTATTCG CGCGCAACGC ACCGCCATTC GGGAAAGACG 
ACAAATCGAC GTGCATAAGC GCGCGTTGCG TGGCGGTAAG CCCTTTCTGC 

5051 GTGGTGCGCT CGTCGGGCAC CAGGTGCACG CGCCIAACCGC GGTTGTGCAG 
CACCACGCGA GCAGCCCGTG GTCCACGTGC GCGGTTGGCG CCAACACGTC 

5101 GGTGACAAGG TCAACGCTGG TGGCTACCTC TCCGCGTAGG CGCTCGTTGG 
CCACTGTTCC AGTTGCGACC ACCGATGGAG AGGCGCATCC GCGAGCAACC 

5151 TCCAGCAGAG_GCGGCCGCCC TTGCG CGAG C AGAATGGCGG TAGGGGGTCT 
AGGTCGTCTC CGCCGGCGGCS" AAC(K:GCTCG" TCTTACCGCC ' ATCCCCCAGA 

5201 AGCTGCGTCT CGTCCGGGGG GTCTGCGTCC ACGGTAAAGA CCCCGGGCAG 
TCGACGCAGA GCAGGCCCCC CAGACGCAGG TGCCATTTCT GGGGCCCGTC 

5251 CAGGCGCGCG TCGAAGTAGT CTATCTTGCA TCCTTGCAAG TCTAGCGCCT 
GTCCGCGCGC AGCTTCATCA 6ATAGAACGT AGGAACGTTC AGATCGCGGA 

5301 GCTGCCATGC GCSGGCGGCA AGCGC6CGCT CGTATGGGTT GAGTGGGGGA 
CGACGGTACG CGCCCGCCGT TCGCGCGCGA GCATACCCAA CTCACCCCCT 

5351 CCCCATGGCA TGGGGTGGGT GAGCGCGG7«3 6CGTACATGC CGCAAATGTC 
GGGGTACCGT ACCCCACCCA CTCGCGCCTC CGCATGTACG GCGTTTACAG 

5401 GTAAACGTAG A65GGCTCTC TGAGTATTCC AAGATATGTA GGGTAGCATC 
CATTTGCATC .TCCCCGAGAG ACTCATAAGG TTCTATACAT CCCATC6TAG 

5451 TTCCACCGCG GATGCTG6CG CGCAC6TAAT CGTATAGTTC GTGCGAGGGA 
AAGGTGGCGC CTACGACCGC GCGTGCATTA GCATATCAAG CACGCTCCCT 

5501 GCGAGGAGGT CGGGACCGA6 GTTGCTACGG GCGGGCTGCT CT6CTCGGAA 
CGCTCCTCCA GCCCTGGCTC CAACGATOCC CGCCCGACGA GACGAGCCTT 

5551 GACTATCTGC CTGAAGATGG CATGTGAGTT GGAT6ATATG GTTGGACGCT 
CTGATAGACG GACTTCTACC 6TACACTCAA CCTACTATAC CAACCTGCGA 
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5601 



GGAAGACGTT 
CCTTCTGCAA 



G^kcTGGCG TCTGTGAGAC CTACCGCGTC 
CTOfcGACCGC AGACACTCTG GATC3CCGCAG 



5651 GAGGCGTAGG A5TCGCGCAG CTTGTTGACC AGCTCGGCGG TGACCTGCAC 
CTCCGCATCC TCAGCGCCTC GAACAACTGG TCGAGCCGCC ACTQGACGTG 

5701 GTCTAGGGCG CACTAGTCCA GCGTTTCCTT GATGATGTCA TACTTATCCT 
CAGATCCCGC GTCATCAGGT CCCAAAGGAA CTACTACAGT ATGAATAQGA 

5751 GTCCCTTTTT TTTCCACAGC rCGCGGTTGA GGACAAACTC TTCGCGGTCT 
CA6GGAAAAA AAAGGTGTCG AGCGCCAACT CCTGTTTGAG AAGCGCCAGA 

5801 TTCCAGTACT CTTGGATCGG AAACCCGTCG GCCTCCGAAC GGTAAGAGCC 
AAGGTCATGA GAACCTAGCC TTTGGGCAGC CGGAGGCTTG CCATTCTCGG 

5B51 TAGCATGTAG AACTGGTTGA CGGCCTGGTA 6GCGCAGCAT CCCTTTTCTA 
ATCGTACATC TTGACCAACT CCXTGGACCAT CCGCGTCGTA GGGAAAAGAT 

5901 CGGGTAGCGC GTATGCCTGC GCGGCCTTCC GGAGCGAGGT GTGGGTGAGC 
GCCCATCGCG CATACGGACG CGCCGGAAGG CCTCGCTCCA CACCCACTCG 

5951 GCAAAGGTGT CCCTGACCAT GACTTTGAGG TACTGGTATT TGAAGTCAGT 
C3TTTCCACA GGGACTGGTA CTGAAACTCC ATGACCATAA ACTTCAGTCA 

6001 GTCGTCGCAT CCGCCCTGCT CCCAGAGCAA AAAGTCCGTG CGCTTTTTGG 
CAGCAGCGTA GGCGGGACGA GGGTCTCGTT TTTCAGGCAC GCGAAAAACC 

6051 AACGCGGATT TGGCAGGGCG AAGGTGACAT CGTTGAAGAG TATCTTTCCC 
TTGCGCCTAA ACCGTCCCGC TTCCACTGTA GCAACTTCTC ATAGAAAGGG 

6101 GCGCGAGGCA TAAAGTTGCG TGTGATGCGG AAGGGTCCCG GCACCTCGGA 
CGCGCTCCGT ATTTCAACGC ACACTACGCC TTCCCAGGGC CGTGGAGCCT 

6151 ACGGTTGTTA ATTACCTGGG CGGCGAGCAC GATCTCGTCA AAGCCGTTGA 
TGCCAACAAT TAATGGACCC GCCGCTCGTG CTAGAGCAGT TTCGGCAACT 

6201 TGTTGTGGCC CACAATGTAA AGTTCCAAGA AGCGCGGGAT GCCCTTGATG 
ACAACACCGG GTGTTACATT TCAAGGTTCT TCGCGCCCTA CGGGAACTAC 

6251 GAAGGCAATT TTTTAAGTTC CTCGTAGGTG AGCTCTTCAG GGGAGCTGAG 
CTTCCGTTAA AAAATTCAAG GAGCATCCAC TCGAGAAGTC CCCTCGACTC 

6301 CCCGTGCTCT GAAAGGGCCC AGTCTGCAAG ATGAGGGTTG GAAGCGACGA 
GGGCACGAGA CTTTCCCGGG TCAGACGTTC TACTCCCAAC CTTCGCTGCT 

6351 ATGAGCTCCA CA6GTCACGG GCCATTAGCA TTTGCAGGTG GTCGCGAAAG 
TACTCGAGGT GTCCAQTGCC CGGTAATCGT AAACGTCCAC CAGCGCTTTC 

6401 GTCCTAAACT GGCGACCTAT GGCCATTTTT TCTCGGGTGA TQCAGTAGAA 
CACGATTTGA CCGCTGGATA CCGGTAAAAA AGACCCCACT ACGTCATCTT 

6451 GGTAAGCG6G TCTTGTTCCC AGCGGTCCCA TCCAAGGTTC GCGGCTAGGT 
CCATTCGCCC AGAAPkAGGG TCGCCAGGGT AGGTTCCAAG CGCCGATCCA 

6501 CTCGCGCGGC A6TCACTAGA GQCTCATCTC CGCCGAACTT CATGACCACC 
CAGCGCGCCG TCAGTGATCT CCGAGTAGAG GCGGCTTGAA GTACTGGTCG 
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6551 ATGAAGOGCA.C^kcOCCTT CCCAAAGGCC CCCATCCAAG TXTfiG(0mc 
TACTTCCCGT G?!T&GACGAA GGGTTTCCGG GGGTAGGTTC ATATCCJBSJ^G 

66 Dl TACATCGTAG GTGACAAAGA GACGCTCGGT GCGAGGATGC GAGCCGATCG 
ATGTAGCATC CACTGTTTCT CTGCGAGCCA CGCTCCTACG CTCGGCTAGC 

6651 GGAAGAACTG GATCTCCCGC CACCAATTGG AGGAGTGGCT ATTGATGTGG 
CCTTCTTGAC CTAGAGGGCG GTGGTTAACC TCCTCACCGA TAACTACACC 

6701 TGAAAGTAGA AGTCCCTGCG ACGGGCCGAA CACTCGTGCT GGCTTTTGTA 

ACTTTCATCT TCAGGGACGC TGCCCGGCTT GTGAGCACGA CCGAAAACAT - 

6751 AAAACGTGCG CAGTACTGGC AGCGGTCCAC GGGCTGTACA TCCTGCACGA 
TTTTGCACGC GTCATGACCG TCGCCACGTG CCCGACATGT AGGACGTGCT 

6801 GGTTGACCTG ACGACCGCGC ACAAGGAAGC AGAGTGGGAA TTTGAGCCCC 
CCAACTGGAC TGCTGGCGCX3 TGTTCCTTCG TCTCACCCTT AAACTCGGGG 

6B51 TCGCCTGGCG QGTTTGGCTG GTGGTCTTCT ACTTCGGCTG CTTGTCCTTG 
A6CGGACCGC CCAAACCGAC CACCAGAAGA TGAAGCCGAC 6AACAGGAAC 

6901 ACCGTCTGGC TGCTCGAGGG GAGTTACGGT GGATCGGACC ACCACGCCGC 
TGGCAGACCG ACGAGCTCCC CTCAATGCCA CCTAGCCTCG TGGTGCGGCG 

6951 GCGAGCCCAA AGTCCAGATG TCCGCGCGCG GCGGTCGGAG CTTGATGACA 
CGCTCGGGTT TCAGGTCTAC AGGCGCGCGC CGCCAGCCTC GAACTACTCT 

7001 ACATCGCGCA GATGGGAGCT GTCCATGGTC TGGAGCTCCC GCGGCGTCAG 
TGTAGCGCGT CTACCCTCGA CAGGTACCAG ACCTCGAGGG CGCCGCAGTC 

7051 GTCAGGCGGG AGCTCCTGCA GGTTTACCTC GCATAGACGG GTCAGGGCGC 
CAGTCCGCCC TCGAGGACCT CCAAATGGAG CGTAtCTGCC CAGTCCCGCG 

7101 GGGCTAGATC CAGGTGATAC CTAATTTCCA GGGGCTGGTT GGTGGCGGCG 
CCCGATCTAG GTCCACTATG GATTAAAGGT CCCCGACCAA CCACCGCCGC 

7151 TCGATGGCTT GCAAGAGGCC GCATCCCCGC GGCGCGACTA CGGTACCGCG 
AGCTACCGAA CGTTCTCCGG CGTAGGGGCG CCGCGCTGAT GCCATGGCGC 

7201 CGGCGGGCGG TGGGCCGCGG GGGTGTCCTT GGATGATGCA TCTAAAAGCG 
GCCGCCCGCC ACCCGGCGCC CCCACAGGAA CCTACTACGT AGATTTTCGC 

7251 GTGACGCGGG CGAGCCCCCG GAGGTAGGGG GGGCTCCGGA CCCGCCGGGA 
CACTGCGCCC GCTCGGGGGC CTCCATCCCC CCCGAGGCCT GGGCGGCCCT 

7301 GAGGGGGCAG GGGCACGTCG GCGCCGCGCG CGGGCAGGAG CTGGTGCTGC 
CTCCCCCGTC CCCGTGCAGC CGCGGCGCGC GCCCGTCCTC GACCACGACG 

7351 GCGCGTAGGT TGCTGGCGAA CGC6AC6ACG CGGCGGTTGA TCTCCTGAAT 
CGC6CATCCA ACGACGGCTT GCGCTGCTGC GCCGCCAACT AGAGGACTTA 

7401 CTGGCGCCTC TGCGTGAAGA CGACGGGCCC GGTGAGCTTG AACCTGAAAG 
GACCGCGGAG ACGCACTTCT GCTGCXICGGG CCACTCGAAC TTGGACTTTC 

7451 AGAGTTCGAC AGAATCAATT TCGGTGTCGT TGACGGCGGC CTGGCGCAAA 
TCTCAAGCTC TCTTAGTTAA AGCCACAGCA ACTGCCGCCG GACCGCGTTT 
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7501 ATCTCCTGCA C^iCTnCTGA GTTGTCTTGA TAGGCG;?!^ GGGC&^U^ 
TAGAGGACGT dSRAGGACT CAACA»ACT ATCCGCTA6A GCCOGTHmtT 

7551 CTGCTCGATC TCTTCCTCCT 66AGATCTCC GCGTCCGGCT CGCTCCACGG 
GACGAGCTAG ASA21GGAGGA CCTCTAGAG6 CGCA6GCCGA GC6AGGTGCC 

7601 TGGCGGCGA6 6TCGTT6GAA ATGCGGGCCA TGA6CTGCGA GAA66CGTTG 
ACCGCCGCTC CAGCAACCTT TACGCCCGGr ACTCGACGCT CTTCCGCAAC 

7651 AGGCCTCCCT CGTTCCAGAC GCGGCTGTAG ACCACGCCCC CTTCGGCATC 
TCC6GA6GGA GCAAGGTCTG CGCCGACATC 7GGTGCGGGG GAAGCC6TAG 

7701 GCGG6C6CGC ATGACCACCT GCGCGAGATT GAGCTCCAC6 TGCCGGGCGA 
CGCCCGCGCG TACTGGTGGA CGCGCTCTAA CTCGAGGTGC ACGGCCCGCT 

7751 AGACGGCGTA 6TTTCGCAGG CGCTGAAAGA GGTAGTTGAG GGTGGT6GC6 
TCT6CCGCAT CAAA6CGTCC QC6AC37TCT CCATCAACTC CCACCACCGC 

7801 GTGTGTTCTG CCACGAAGJUL GTACATAACC CAGCGTCGCA ACGTGGATTC 
CACACAAGAC GGTGCTTCTT CATGTATTGG GTC6CAGCGT TGCACCTAAG 

7851 GTTGATATCC CCCAAGGCCT CAAGGCGCTC CATGGCCTCG TAGAAGTCCA 
CAACTATAGG GGGTTCCG6A 6TTCCGCGAG GTACCGGAGC ATCTTCA6GT 

7901 CGGCGAAGTT GAAAAACTGG GAGTTGCGCG CCGACACGGT TAACTCCTCC 
GCCGCTTCAA CTTTTTGACC CTCAACGCGC GGCTGTGCCA ATT6AGGA3G 

7951 TCCAGAAGAC GGATGAGCTC GGCGACAGTG TCGCGCACCT CGCGCTCAAA 
AGGTCTTCTG CCTACTCGAG CCGCTGTCAC AGCGCGTGGA GCGCGAGTTT 

8001 6GCTACAGGG GCCTCTTCTT CTTCTTCAAT CTCCTCTTCC ATAAGGGCCT 
CCGATGTCCC CXK2AGAAGAA GAAGAAG7TA GAG6AGAAGG TATTCCCGGA 

8051 CCCCTTCTTC TTCTTCTGGC GGCGGTGGGG GAGGGGGGAC ACGGCGGC6A 
6GGGAAGAAG AAGAA6ACCG CCGCCACCCC CTCCCCCCTG TGCCGCCGCT 

8101 CGACGGCGCA CCGGGAGGCG GTCGACAAAG CGCTCGATCA TCTCCCCGCG 
GCTGCCGCGT GGCCCTCCGC CAGCTGTTTC GCGAGCTAGT AGAGGGGCGC 

8151 GCGACGGCGC ATGGTCTCGG TGACGGCGCG GCCGTTCTCG C6GGGGCGCA 
CGCTGCCGCG TACCAGA6CC ACTGCCGCGC CGGCAAGAGC OCCCCCGCGT 

8201 GTTGGAAGAC GCCGCCCGTC ATGTCCCGGT TATGGGTTOG CGGCGGGCTC 
CAACCTTCTG CGGCGGGCAG TACAGGGCCA ATACCCAACC 6CCCCCCGAC 

8251 CCATGC6GCA GGGATACGGC GCTAACGATG CATCTCAACA ATTGTTGTGT 
GGTACGCCGT CCCTATGCCG CGATTGCTAC 6TAGAGTTGT TAACAACACA 

8301 AGGTACTCCG CCGCCGAGG6 ACCT6AGCGA GTCCGCATC6 ACC6GATCGG 
TCCATGAGGC GGCGGCTCCC TOGACTCGCT CAGGCGTAGC TGGCCTAGCC 

6351 AAAACCTCTC GAGAAAGGCG TCTAACCAGT CACAGTC6CA AGGTAGGCTG 
. • TTTTGGAGAG CTCTTTCCGC AGATTGGTCA GTCTCAGCGT TCCATCCGAC 

8401 AGCACCGT6G CGGQCGGCAG CGGGCGGCGG TCGGGGTTGT TTCTGGCGGA 
1CGTGGCACC GCCCGCCGTC GCCCGCCGCX: AGfCCCCAACA AAGACCGCCT 
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8451 GffTGCTGCTG J^ll^rGTMT TAAAGTAGGC G61CTTGAGA CGGCGCttmS 
CCACGACGAC ifflKTACATTA ATTTCATCCG CCAGAACTCT GCCGCCTSCC 

8501 TCGACAGAAG CACCATGTCC TTGGGTCCGG CCTGCTGAAT GC6CAGGCGG 
AGCTGTCTTC GTGGTACAG6 AACCCAGGCC G6ACGACTTA CGCGTCCGCC 



8551 TCGGCCATGC CCCAGGCTTC GTTTTGACAT CGGCGCAGGT CTTTGTAGTA 
AOCCGGTACG OGGTCCGAftG CAAAACTGTA GCCGCGTCCA GA^^ACATCAT 

m 

8601 GTCTTGCAT6 AGCCTTTCTA CCGGCACTTC TTCTTCTCCT TCCTCTT6TC 
CAGAACGTAC TCOGAAAGAT GGCCG7GAA6 AAGAAGAGGA AGGAGAACAG 

8651 CTGCATCTCT TGCATCTATC GCTGCGGC6G CGGCGGAGTT TG6CCGTAGG 
GACGTAGAGA ACGTAGATAG CGACGCC6CC GCCGCCTCAA ACCG6CATCC 

8701 TGGCGCCCTC ncCTCCCAT GCGTGTGACC CC6AAGCCCC TCATCGGCTG 
ACCGCGG6AG AAGGAGGGTA CGCACACTGG- GGCTTCGGGG AGTAGCCGAC 

8751 AAGCAGGGCT AGGTCOGCGA CAACGCGCTC G6CTAATAT6 GCCTGCTGCA 
TTCGTCCCGA TCCAGCCGCT GTTGCGCGAG CCGATTATAC CG6ACGACGT 

8B01 CCTGCGTGAG GGTAGACTGG AAGTCATCCA T6TCCACAAA GCGGTGGTAT 
6GACGCACTC CXrATCTGACC TTCAGTA6GT ACAGGTGTTT CGCCACCATA 



8851 GCGCCCGTGT TGATGGTGTA AGTGCAGTTG GCCATAACGG ACCA6TTAAC 
CGCGGGCACA ACTACCACAT TCACGTCAAC CGGTATTGCC TGGTCAATTG 

8901 GGTCTGGTGA CCCGGCTGCG AGAGCTCGGT GTACCTGAGA CGCGAGTAAG 
CCAGACCACT GGGCCGACGC TCTCGA6CCA CATGGACTCT GCGCTCATTC 

8951 CCCTCGAGTC AAATACGTAG TCGTTGCAAG TCCGCACCAG GTACTGGTAT 
GGGAGCTCAG TTTATGCATC AGCAACGTTC AGGCGTGGTC rATGACCATA 

9001 CCCACCAAAA AGTGCGGCGG CGGCTGGCGG TAGAGGGGCC AGCGTAGGGT 
GGGTGGTTTT TCACGCCGCC GCCGACCGCC ATCTCCCCGG TCGCATCCCA 

9051 G6CCGGGGCT CCGGGGGCGA GATCT7CCAA CATAAGGCGA TGATATCCGT 
CCGGCCCCGA GGCCCCCGCT CTAGAAGGTT GTATTCCGCT ACTATAGGCA 

9101 AGATGTACCT GGACATCCAG GTGATGCCGG CGGCGGTGGT GG^^GGCGCGC 
TCTACATGGA CCTGTAGGTC CACTACGGCC GCCGCCACCA CCTCCGCGCG 



9151 GGAAAGTCGC GGACGCGGTT CCAGATGTTG C6CAGCGGCA AAAAGTGCTC 
CCTTTCAGCG CCTGCGCCAA GGTCTACAAC 6CGTCGCCGT TTTTCACGAG 

9201 CATGGTCGGG AC6CTCTGGC CGGTCAGGCG CGCGCAATCG TT6ACGCTCT 
GTACCAGCCC TGCGAGACCG GCCAGTCCGC GCGCGTTAGC AACTCCGAGA 

9251 AGACCGTGCA AAAGGAGAGC CTGIAAGCGG GCACTCTTCC GTGGTCTGGT 
TCTGGCACGT TTTCCTCTCG GACATTCGCC CGTGAGAAGG CACXrAGACCA 

9301 GGATAAATTC GCAAGG6TAT CATGGCGGAC GACCGGGGTT CGAGCCCCG7 
CCTATTTAAG CGTTCCCATA GTACCGCCTG CTGGCCCCAA GCTCGGGGCA 

9351 ATCCGGCC6T CCGCCGTGAT CCATGCGGTT ACCGOCCGCG TGTCGAACCC 
TAGGCCGGCA G6CGGCACTA G6TACGCCAA TGGCGGGCGC ACAGCTTGGG 
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BiOl AGGTGTGCGA otfkuSACAA CGGCGGACTG CTCCTTTTSG LiJLLl««A 
TCCACACGCT gS55tCTGTT GCCCCCTCAC GA6GAAAACC GAAGGASQ&T 

9451 GGCGCGGCGG CTGCTGCGCT AGCTTTTTTG GCCACTGGCC GCGCGCAGCG 
CCGCGCCGCC 6ACGACGCGA TCGAAAAAAC CGGTGACCQG CGCGCGTCGC 

9501 TAAGCGGTTA GGCTGGAAAG CGAAAGCATT ATIGTGGCTCG CTCCCTGTAG 
ATTCGCCAAT CCGACCTTTC GCTTTCGIAA TTCACC6AGC GAGGGAC^^TC 

9551 CCGGAGGGTT ATTTTCCAAG GGTTGAGTCG CGGGhCCCCC GGTTCGAGTC 
GGCCTCCCAA TAAAAGGTTC CCAACTCAGC GCCCTGGGGG CXJ^CTCAG 



9601 TCGGACCGGC CGGACTGCGG CGAAC6SGGG TTTGCCTCCC CGTCATGCAA 
AGCCTGGCCG 6CCTGACGCC GCTTGCCCCC AAACGGAGGG GCAGTACGTT 

9651 GACCCCGCTT GCAAATTCCT CCGGAAACAG GGACGAGCCC CTTTTTTGCT 
CTGGGGCGAA CCTTTAAGGA GGCCTTTGTC CCTGCTCGCG GAAAAAACGA 

9701 TTTCCCAGAT GCATCCGGTG CTGCGGCAGA TGCGCCCCCC TCCTCAGCAG 
AAAGGGTCTA CGTAGGCCAC GACGCCSTCT ACGCGGGGGG AGGAGTCGTC 

9751 CGGCAAGAGC A^GAGCAGCG GCAGACATGC AG6GCACCCT CCCCTCCTCC 
GCCGTTCTCG TTCTCGTCGC CGTCTGTACG TCCCGTGGGA GGGGAGGAGG 

98D1 TACCGCGTCA GSAGGGGCGA CATCCGCGGT TGACGCGGCA GCAGATGGTG 
ATGGCGCAGT CCTCCCCGCT GTAGGC3CCA ACTGCGCCGT CGTCTACCAC 

9851 ATTACGAACC CCXTGCGGCGC CGGGCCCGGC ACTACCTGGA CTTGGAGGAG 
TAATGCTTGG GGGCGCCGCG GCCCGGGCCG TGATGGACCT GAACCTCCTC 

9901 GGCGAGGGCC TGGCGCGGCT AGGAGCGCCC TCTCCTGAGC GGCACCCAAG 
CCGCTCCCGG ATCGCGCCGA TCCTCGCGGG AGAGGACTCG CCGTGGGTTC 

9951 GGTGCAGCTG AAGCGTGATA CGCGTGftGGC GTACGTGCCG CGGCAGAACC 
CCACGTCGAC TPCGCACTAT GCGCACTCCG CATGCACGGC GCCGTCTTGG 

10001 TGTTTCGCGA CCGC6AGGGA GAGGAGCCCG AGGAGATGCG GGATCGAAAG 
ACAAAGCGCT GGCGCTCCCT CTCCTCGGGC TCCTCTACGC CCTAGCTTTC 

10051 TTCCACGCAG GGCGCGAGCT GCGGCATGGC CTGAATCGCG AGCGGTTGCT 
AAGGTGCGTC CCGCGCTCGA CGCCGTACCG 6ACTTAGCGC TCGCCAACGA 

101 01 GCGCGAGGAG GACTTTGAGC CCGACGCGCC? AACCGGGATT AGTCCCGCGC 
CGCGCTCCTC CTGAAACTCG GGCTGCGCGC TTGGCCCTAA TCAGGGCGCG 

10151 GCGCACACGT GGCGGCCGCC GACCTGGTAA CCGCATACGA GCAGACGGTG 
CGCGTGTGCA CCGCCGGCGG CTGGACXATT GGCX3TATGCT CGTCTGCCAC 

10201 AACCAGGAGA TTAACTTTCA AAAAAGCTTT AACAACXIACG TGCGTACGCT 
TTGGTCCTCT AATTGAAAGT TTTTTCGAAA TTGTTGGTGC ACGCATGCGA 

10251 TGTGGCGCGC GAGGAGGTGG CTATAGGACT GATGCATCTG TGGGACTTTG 
ACACCGCGCG CTCCTCCACC GATATCCTGA CTACGTAGAC ACCCTGAAAC 

10301 taagcgcgct ggagcaaaac ccaaatagca agcxtgctcat ggcgcagctg 

ATTCGCCCCA CCTCCTTTTG GGTTTATCGT TCGGCGAGTA CCGCGTCGAC 
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10351 TTCCTTATA6 T^ftpCACAG CAGGGACAAC GAGGCATTCA GGGATGl 

AAGGAATATC AWTCGTGTC GTCCCTGTTG CTCCGTAAGT CCXTACGRA 

10401 GCTAAACATA GTAGA6CCCG AGGGCCGCT6 6CTGCTCGAT TTGATAAACA 
CGATTTGTAT CATCTCGGGC TCCCGGCGMT CGAC6AGCTA AACTATTTGT 

10451 TCCTGCAGAG CATAGTG6TG CAGGAGCGCA GCTTGA6CCT 6GCTGACAAG 
AGGACGTCTC GTATCACCAC 6TCCTC6CGT CGAACTCGGA CCGACTGTTC 

10501 GT0GCC6CCA TCAACTATTC CATGCTTAGC CTGGGCAAGT TTTACGCCCG 
CACCGGCGGT AGrTGATAVkG GTACGAATCG GACCCGTTCA AAATGCGGGC 

10551 CAAGATATAC CATACCCCTT ACGTTCCCAT AGACAAGGAG GTAAAGATCG 
GTTCTATATG GTATGGGGAA TGCAAGGGTA TCTCTTCCTC CATTTCTAGC 

10601 AGG6GTTCTA CATGCGCATG GCGCTGA^ TGCTTACCTT GAGCGACGAC 
ICCCCAAGAT GTACGCGTAC CGCGACTTCC ACGAATGGAA CTCGCTGCTG 

10651 CT6GGCGTTT ATCGCAACGA GCGCATCCAC AAGGCCGTGA GCGTGAOCCG 
GACCC6CAAA TAQCGTT6CT CGCCTAGGTG TTCCGGCACT CGCACTCGGC 

10701 GCG6CGCGAG CTCAGCGACC 6CGA6CTGAT GCACAGCCTG CAAAGGGCCC 
CGCCGCGCTC GAGTCGCT6G CGCTC6ACTA CGTGTCGGAC 3TTTCCCGGG 

10*751 TGGCTGGCAC GGGCA6CGGC GATAGAGAGG CCGAGTCCTA CTTTGACGCG 
ACCGACCGTG CCCGTCGCCG CTATCTCTCC GGCTCAGGAT GAAACTGCGC 

10801 GGCGCTGACC TGCGCTGGGC CCCAAGCCGA CGCGCCCTGG AGGCAGCTGG 
CCGCGACTGG ACGCGACCCG GGGTTCGQCT GCGCG6GACC TCCGTCGACC 

10851 GGCCGGACCT GGGCTGGCGG TGGCACCCGC GCGCGCTGGC AACGTCGGCG 
CCGGCCTGGA CCCGACCGCC ACCGTGSGCG CGCGC6ACCG TTGCAGCCGC 

10901 GCGTG6AGGA ATATGACGAG iSACGATSAGT ACGAGCCAGA GGACGGCGAG 
CGCACCTCCT TATACTGCTC CTGCTACTCA TGCTCGGTCT CCTGCCGCTC 

10951 TACTAAGCGG TGATGTTTCT GATCAGATGA TGCAAGACGC AACGGACCCG 
ATGATTCGCC ACTACAAA6A CTAGTCTACT ACGTTCTGCG TTGCCTGGGC 

11001 GCGGTGCGGG CGGCGCTGCA GAGCCAGCCG TCCGGCCTTA ACTCCACGGA 
COCCACGCCC GCCGCGACCT CTCCGTCGGC AGGCCGGAAT TGAGGTGCCT 

11051 CGACTGGCGC CAGGTCATGG ACCGCATCAT GTCGCTGACT GCGCGCAATC 
GCT6ACCGCG GTCCAGTACC TGGCGTAG7A CAGCGftCTGA CGCGCGTTAG 

11101 CTGACGCGTT CCGGCAGCAG CCGCAGGCCA ACCGGCTCTC CGCAATTCTG 
6ACTGCGCAA GSCCGTCGTC GGCGTOCGGT 7G0CCGAGAG GCGTTAAGAC 

11151 GAAGCGGTGG TCCCGGOGCG CGCAAACCCC ACGCACGAGA A6GTGCTGGC 
CTTCGCCACX: J^GGGCCGCGC GCXSTTTGGGG T60GTGCTCT TCCACGACCG 

11201 GATCGTAAAC GCGCTOGCC6 AAAACAGGGC CATCCGGCCC GACGAGGCCG 
CTAOCATTTG CQCGACCGGC TTTTGTCCCG GTAGGCCGGC CTGCTCCGGC 

11251 GCCTGGTCTA CX3ACGC0CTG CTTCAGCGCG TGGCTCGTTA CAACAGCGGC 
CGGACCAGAT QCTGCGCGAC GAAGTCGCGC ACCGAGCAAT GTTGTCGCCG 
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11301 AACGTGCAGA C^PCTCGA CCGGCTGGTG GGGGATCnfeC' GCSAGC^^T 
TTGCACGTCT gSKtGGACCT GGCCGACCAC CCCCTACACG CGCTCCSB&A 

11-351 6GCGCAGCGT GAGCGCGCGC AGCAGCAGGG CAACCTGGGC TCCATGGTTG 
CCGCGTCGCA CTCGCGCGCG TCGTC6TCCC GTTGGACCCC AGGTACCAAC 

11401 CACTAAACGC CITCCTGAGT ACACAGCCCG CCAACGTGCC GCGGGGACAG 
GTGATTTGCG GAAGGACTCA TGTGTCGGGC GGTTGCACGG CGCCCCTGTC 

11451 GAGGACTACA CCAACTTTGT GAGCGCACTG CGGCTAATGG TGAC!rGAGAC 
CTCCTGATGT GGT7-GAAACA CTCGCGTGAC GCCGATTACC ACTGACTCTG 

11501 ACCGCAAAGT SAGGTGTACC AGTCTG3GCC AGACTATTTT TTCCAGACCA 
TGGCGTTTCA CTCCACATGG TCAGACCCGG TCTGATAAAA AAGGTCTGGT 

11551 GTAGACAAGG CCT6CAGACC GTAAACCTGA GCCAGGCTTT CAAAAACTTG 
CATCTGTTCC GGACGTCTGG CATTTGGACT CGGTCCGAAA GTTTTTGAAC 

11601 CAGGGGCTGT GGOGGGTGCG GGCTCCCACA GGCGACCGCG CGACCGTGTC 
GTCCCCGACA CCCCCCACGC CCGAGQGTGT CCGCTGGCGC GCTGGCACAG 

11651 TA6CTTGCTG ACGCCCAACT CGCGCCTGTT GCTGCTGCTA ATAGCGCCCT 
ATC6AACGAC TCCGGGTTGA GCGCGGACAA CGACGACGAT TATCGCGGGA 

11701 TCACGGACAG TGGCAGCGTG TCCCGGGACA CATACCTAGG TCACTTGCTG 
AGTGCCTGTC ACC6TCGCAC AGGGCCCTGT GTATGGATCC AGTGAACGAC 

11751 ACACTGTACC GCGAGGCCAT AGGTCAGGCG CATGTGGACG AGCATACTTT 
TGTGACATGG CGCTCCGGTA TCCAGTCCGC GTACACCTGC TCGTATGAAA 

11801 CCAGGAGATT ACAAGTGTCA GCCGCGCGCT GGGGCAGGAG GACACGGGCA 
GCTCCTCTAA TGTTCACAGT CGGCGCGCGA CCCCGTCCTC CTGTGCCCGT 

11 851 GCCTGGAGGC AACCCTAAAC TACCTGCTGA CCAACCGGCG GCAGAAGATC 
CGGACCTCCG TTGGGATTTG ATGGACGACT 6GTTGGCCGC CGTCTTCTAG 

11901 CCCTCGTTGC ACAGTTTAAA CAGCGAGGAG GAGCGCATTT TGCGCTACGT 
GGGAGCAACG TGTCAAATTT GTCGCTCCTC CTCGCGTAAA ACGCGATGCA 

11951 GCAGCAGAGC GTGACCCTTA ACCTGATGCG CGACGGGGTA ACGCCCAGCG 
CGTCGTCTCG CACTCGGAAT TGGACTACGC GCTGCCCCAT TQCGGGTCGC 

12001 TGGCGCTGGA CATGACCGCG CGCAACATSG AACCGGGCAT GTATGCCTCA 
ACCGCGACCT GTACTGGCGC GCGTTGTACC TTGGCCCGTA CATACGGAGT 

12051 AACCGGCCGT TTATCAACCG CCTAATG6AC TACTTGCATC GCGCGGCCGC 
TTGGCCGGCA AATAGITCGC GGATTACCTG ATGA&CGTAG CGCGCC6GCG 

12101 CGTGAACCCC GAGTATTTCA CCAATGCCAT CTT6AACCCG CACTGGCTAC 
GCACTTGGGG CTCATAAAGT GGTTACGGTA GAACTTGGGC GTGACCGAT6 

12151 CGCXrCCCTGG TTTCTACACC GGGGGATTCG AGGTGCCCGA GGGTAACGAT 
GCGGGGGACC AAAGATGTGG CCCCCTAAGC TCCACGGGCT CCCATTGCTA 

12201 G6ATTCCTCT GGGACGACAT AGACGACAGC GTGTTTTCCC C6CAACCGCA 
CCTAAGGAGA CCCTGCTGTA TCTGCTGTCG CACAAAAGGG GCGTTGGCGT 
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12251 

12301 

12351 

12401 

12451 

12501 

12551 

12601 

12651 

12701 

12751 

12801 

12851 

12901 

12551 

13001 

13051 

13101 

13151 



GkCCCTGCTA GmffTGCIAC AGCGCGkSCA GGCAGAGa?cnXOCTQaii&A 
CTGGG^^CG^T dWACGTTG 7CGCGCTCGT CCQTCTCCGC CGCGmJvL 1 

AGGMAGCTT CCGCA6GCCA AGCAGCTTGT CCGATCTA66 CX3CTGCGGCC 
TCCTTTCGAA GGCGTCCGGT TCGKGMUCk GGCTAGA7CC GCGACGCC6G 

CCGCGGTCAG AT0CTAGTA6 CCCATTTCCA AGCTTGATAG (3GTCTCTTAC 
GGCGCCAGTC TACGATCATC GGGTAAAGGT TC6AACTATC CCAGAffl^TG 

CAGCACTCGC ACCACCCGCC CGCGCCTGCT 66GCGAG6AG GA6TACCTAA 
GTCGTGAGCG T0G76Q3C6G_ GCGCGGU^OS^ CO^GCTCCTC CTCAT^^ 

ACAACTCGCT GCTGCAGCC6 CAGCGCGAAA AAAACCTGCC TCCGGCATTT 
TGTTGAGCGA CGftCGTCGGC GTCGCGCTTT TTTTGGACGG AGGCCGTAAA 

CCCAACAACG 6GATAGAGA6 CCTAGTGGAC AAGATGAGTA GATGGAAGAC 
GGGTTGTTGC CCTATCTCTC GGATCACCTC TTCTACTCAT CTACCTTCTG 

GTACGCGCAG GAGCACAG66 ACG76CCAGG CCCGCGCCCG CXTCACCCGTC 
CATGCGCGTC CTCGTQTCCC TOCACGGTCC GGGCGCGGGC GGGTGGGCAG 

GTCAAAGGCA CGACCGTCAG CGGGGTCTGG TGTGGG^ikGGA CGATGACTCG 
CAGTTTCCGT GCTGGCAGTC GCCCCAGACC ACACCCTCCT GCTACTGAGC 

6CAGACGACA 6CAGCGTCCT GGATrTGGGA GGGAGTGGCA ACCCGTTTGC 
CGTCTGCTGT CGTCGCAGGA CCTAAACCCT CCCTCACCGT TGGGCAAACG 

GCACCTTCGC CCCAGSCTGG GGAGAATGTT TTAA^UUiAAA AAAAAGCAT6 
CGTGGAAGCG G3GTCCGACC CCTCTTACAA AATTTTTTTT TTTTTCGTAC 

ATGCAftAATA AAAAACTCAC CAAGGCCATG 6CACCGAGCG TT6GTTTTCT 
TACCTTTTAT TTTTTGAGTG GTTCCG3TAC -CGTGGCTCGC AACCAAAAGA 

TGTATTCCCC TTAGTATGCG GCGCGCGGCG ATGTATGAGG AAGGTCCTCC 
ACATAAG6GG AATCATAC6C CGCGCGCCGC TACATACTCC TTCCAGGAG6 

TCCCTCCTAC GAGAGTGTGG TGAGCGCGGC GCCAGTGGCG GCG6CGCTG6 
AGGGAGGATG C7CTCACACC ACTCGCGCC6 CGGTCACCGC CGCCGCGACC 

GTTCTCCCTT CC3ATGCTCCC CTGCACCXTGC CGTTTGTGCC TCCGCGGTAC 
CAAGA66GAA GCTACGAG6G GACCTGGGC6 GCAA^kCACGG AG6CGCCATG 

CTGCGGCCTA CCGGGGGGAG AAACAGCATC CGTTACTCTG AGTTGGCACC 
GACGCCGGAT GSCCCCCCTC TTT6TC6TAG GCAATGAGAC TCAACCGTGG 

CCTATTCGAC ACCACCCGTG 7GTACCTGGT G6ACAACAAG TCAACGGATG 
GGATAAGCT6 TGGTGGGCAC ACATG6MXA CCTGTTGTTC AGTTGCCTAC 

TGGCATCCCT GAACTACCAG AACGACCACA GCAACTTTCT 6ACCACGGTC 
ACCGTA6GGA CTTGATGGTC TTGCTGGTGT CGTOHSAAAGA CT6GT6CCAG 

ATTCAAAACA ATGACTACAG CCCGGGGGAG GCAAGCACAC A6ACCATCAA, 
TAAGTTTTGT TftCTG&TGTC OGGCCCXXrTC CGTTCGTGTG TCTQGTAGTT 

TCTT6ACGAC CGGTCGCACT GGGGCGGC6A CCTGAAAACC ATCCTGCATA 
AGAACTGCTG GCCAGCGTGA CCCCGCCGCT GGACTTTTGG TAGGACGTAT 
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13201 CCAACATGCC AaK^TGAAC GAGTTCATGT TTACCAATAA ^tyilTASQieG 
GGTTGTACGG iflBfCACTTG CTCAAGTAO^ AATCCTTATT CAAATTfl^C 

13251 CGGGTGATGG TGTCGCGCTT GCCTACTAA5 GACAATCAGG TGGAGCTGAA 
GCCCACTACC ACAGCGCGAA CGGATGATTC CTGTTAGTCC ACCTCGACTT 

13301 ATACGAGTGG GTGGAGTTCA CGCTGCCCGA GGGCAACTAC TCCGAGACCA 
TATGCTCACC CACCTCAAGT GCGACGGGCT CCCGTTGATG AGGCTCTGGT 

13351 TGACCATAGA CCTTATGAAC AACGCGATCG TGGAGCACTA CTTCAAAGTG 
. ACTGGTATCT GGAATACTT6 TTGCGCTASC ACCTCGTGAT GAACTTTCAC 

13401 QGCAGACAGA AC6GGGTTCT GGAAAGCGAC ATCGGGGTAA AGTTTGACAC 
CCGTCTGTCT T6CCCCAAGA CCTTTCGCTG TA6CCCCATT TCAAACTGTG 

13451 CCGCAACTTC AGACTGGGGT TTGAOCCCGT CACTGGTCTT GTCATGCCTG 
GCCCTTGAAG TCT6ACCCCA AACTQGGGCA GTGACCAGAA CAGTACGGAC 

13501 G6GTATATAC AAACGAAGCC TTCCATCCAG ACATCATTTT GCTGCCAGGA 
CCCATATATG TTTGCTTC6G AAGGTAGGTC TGTAGTAAAA CGACGGTCCT 

13551 TGCGGGGTGG ACTTCACCCA CAGCCGCCTG AGCAACTTGT TGGGCATCCG 
ACGCCCCACC TGAAGTGGGT GTCGGCGGAC TCGTTGAACA ACCC6TAGGC 

13601 CAAGCGGCAA CCCTTCCAGG AGGGCTTTAG GATCACCTAC GATGATCTGG 
GTTCGCCGTT GGGAAGGTCC TCCCGAAATC CTACTGGATC CTACTAGACC 

13651 AGGGTGGTAA CATTCCCGCA CTGTT6GATG TGGACGCCTA CCAGGCGAGC 
TCCCACCATT GTAAGGGCGT GACAACCTAC ACCTGCGGAT GGTCCGCTCG 

13701 TTGAAAGATG ACACCGAACA GGGCGG6GGT GGCGCAGGCG GCAGCAACAG 
AACTTTCTAC TGTGGCTTGT CCCGCCCCCA CCGCGTCCGC CGTCGTTGTC 

ISTSl CAGTGGCAGC GGCGCGGAAG AGAACTCCAA CGCGGCAGCC SCCGCAATGC 
GTCACCGTCG CCGCGCCTTC TCTTGAGGTT GCGCCGTCGG CGCCGTTACG 

13801 AGCCGGTG6A G6ACATGAAC GATCATGCCA TTC6CGGCGA CACCTTTGCC 
TCGGCCACCT CCTGTACTTG CTAGTACGGT AAGCGCCGCT GTGGAAACGG 

13851 ACACGGGCTG AGGA6AAGCG CGCTGAGGCC GAAGCAGCGG CCGAAGCTGC 
TGTGCCCGAC TCCTCTTCCC GCGACTCCGG CTTCGTCGCC QGCTTCGACG 

13901 CGCCCCCGCT GCGCAACCCG AGGTCGAGAA GCCTCAGAAG AAACCGGTGA 
GCGGGGGCGA CGCGTTG6GC TCCAGCTCTT CGGAGTCTTC TTTGGCCACT 

13951 TCAAACCCCT 6ACAGAGGAC AGCAAGAAAC GCAGTTACAA CCTAATAAGC 
AGTTTGGGGA CTGTCTCCTG TCGTTCrTTG CGTCAATGTT GGATTATTCG 

14001 AATGACAGCA CCTTCACCCA CTACCGCA6C TOGTACCTTG CATACAACTA 
TTACTGTCGT GGAAGTGGGT CATGGCGTCG ACCATG6AAC GTATGTTGAT 

14051 CGGCiSACCCT CAGACCGGAA TCCGCTCATG GACCCTQCTT TGCACTCCTG 
GCCGCTGGGA 6TCTGGCCTT AGGCQAGTAC CTGGGACGAA ACGTGAGGAC 

14101 ACGTAACCTG CG6CTCGGAG CAGGTCTACT GGTCGTTGCC AGACATGATG 
TGCATTGGAC GOTGAGCCTC GTCCAGATGA CCAGCAACGG TCTGTACTAC 




110/144 



wo 02/22080 



PCT/USO 1/28861 



14151 

142D1 

14251 

14301 

14351 

14401 

14451 

14501 

14551 

14602 

14651 

14701 

14751 

14601 

14851 

14901 

14951 

15001 

15051 



CAAGACCCCG i,« iiCCG tTCCftCGCGC CAiSATCAGCA ACTTTCl^Br 
GTTCT6GGGC AdT^^GAAGGC GAGGTGCGCG GTCTAGTCGT TGAAAG^CEa 

GGTGGGCGCC GftGCTGTTGC CCGTGCACTC CAAGAGCTTC T.«^CAACGACC 
CCACCCGCGG CTC6ACAACG GGCACGTGA6 GTTCTCGAAG ATGT7GCT6G 

AGGCCGTCTA CTCCCAACTC ATCCGCCA6T TTACCTCTCT G.^CCCACGTG 
TCCGGCAGAT GAGGGTTGAC TAGGCGGTCA AATGGAGAGA CTGGGTGCAC 

TTCAATCGCT ITCCCGAGAA CCAGATTTTG GCGCGCCCGC CAGCCCCCAC 
AAGTTAGCGA AAGGGCTCTT GGTCTAAAAC CGCGCGGGCG GICOGGGGTG 

CATCACCACC GTCAGT6AAA ACGTTCCT6C TCTCACAGAT Cf^CGGGf^GC 
GTAGT6GTG6 CAGTCACTTT TGCAAGGACG AGAGTGTCTA 6T6CCCTGCG 

TACCGCTOCG CAACAGCATC GGA0GA6TCC AGCGAGTGAC CATTACTGAC 
ATG6CGACGC GTTGTCGTAG CCTCCTCASQ TC6CTCACTG GTAATGACTC 

6CCASACGCC GCACCTGCCC CTACG7TTAC AAGGCCCT6G GCATA6TCTC 
CGGTCTGCG6 CGT6GACGGG GATGCAAATG TTCCOGGACC CGTATCAGAG 

GCCGCGCGTC CTATCGAGCC GCACTTTTTG AGCAAGCATG TCCATCCTTA 
CGGCGCGCAG GATAGCTCGG CGTGAAAAAC TCGTTCGTAC AGGTAGGAAT 

TATCGCCCAG CAATAACACA GGCTGGGQCC TGC6CTTCCC AAGCAAGATG 
ATAGCGGGTC C3TC*ATTGTGT CCGACCCCGG ACGCGAAGGG TTCGTTCTAC 

TTTGGCGGGG CCAAGAAGCG CTCCGACCAA CACCCACTGC 0CGTGC6CGG 
AAACCGCCCC GGTTCTTCGC GAGGCTGGTT QTGGGTCACG CGCACGCGCC 

GCACTACCGC GCGCCCTGGG GCGCGCACAA ACGCSGCCGC ACTGGGCGCA 
CGTGATGGCG CGCGGGACCC CGCGCGTGTT TGCGCCGGCC TGACCCCCGT 

CCACCGTCGA TGACGCCATC GACGCG6TGG TGGAOGAGGC GCGCAACTAC 
GGTGGCAGCT ACTGCGGTAG CTGCGCCACC ACCTCCTCC6 CX3CGTTGATG 

ACGCrCACGC CGCCACCAGT GTCCACAGTG GACGCGGCCA TTCAGACCGT 
TGCG66TGCG GC6GTGGTCA CAGGTGTCAC CTGCGCCGGT AAGTCTGGCA 

GGTGCGC6GA 0CCCG0C6CT ATSC7AAAAT GAAGAGACOG OGGAGGCGCG 
CCACGCGCCT CGGGCCGCGA TACGATTTTA CTTCTCTGCC QCCTCCGCGC 

TAGCACGTCG CCACCGCCGC CGACCCG-XA CTGCCGCCCA ACGCGCGGCG 
ATCGTGCAGC GGTG6CGGC6 GCTGGGCOGT GACGGCGG6T TGCGCGCCGC 

GCGGCCCTGC TTAACCGCGC ACGTCGCACC 6GCCGACGGG CGGCCATGCG 
CGCCG6GAC6 AAITOGCGCG TOCAGCGTCG CCGGCTGCCC GCC6GTACGC 

CGCCCCTCGA AGGCTGGCCG CGGGTArXGT CACTGTGCCC CCCAGGTCCA 
CCGGCGAGCT TCCGACCGGC 6CCCATAACA 6TGACACG66 G6GTCCAGGT 

GGCGACGAGC GGCCXSCCGCA GCAGCCGCGG CCATTAGTGC TATGACTCAG 
CCGCTGCTCG CCGGCX^GCGT CGTCGGCGCC GGTTATCACG ATACTGAGTC 

GGTCGCAGGG GCAACGTGTA TT6GGTGCGC GACTCGGTTA GCGGCCTGCG 
CCAGCGTCCC CGTTGCACAT AACCCACGCG CTGAGCCAAT CGCCGGACGC 
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15101 CGTGCCXrGTG Cfl^CCCGCC CCCCGCGCAA CTAGATTGCftT^GM A J i—ii 
GCACGGGCAC cWibGGCGG GGGGCQCGTT C3ATCTAACGT TCUiilUA 

15151 ACTTAGACTC GTACTGTTGT ATGTATCCAG CGGCGGCGGC GCGCAACGAA 
TGAATCTGAG CATGACAACA TACATAGGTC GCCGCCGCCG CGCGTTGCTT 



15201 GCTAT5TCCA ADCGCAAAAT CAAAGAAGAG ATGCTCCAGG TCATCCCCCC 
CGATACAGGT rCOCGTTTTA GTTTCTTCTC TACGAOQTCC AOTAOCGCGG 

13251 G6A6ATCTAT GCSCCCCCCGA AGAAGGAA6A GCAGGATTAC AA63CCC6AA 
CCTCTAGATA COGGGGGGCT TCTTCCTTCr CGTCCTAATG TTCSOGOCTT 

15301 A6CTAAA5CG GGTCAAAAA6 AAAAAGAAAG ATGATGATGA TGAACTTGAC 
TC6ATTTCGC CCA6TTTTTC TTTTTCrTTC TACTACTACT ACTTGAACTG 

15351 6AC6AC3GTG6 AACTGCTGCA CGCrACCOCG CCCAGGC6AC 6GGTACAGT3 
CTGCTCCACC TTGAC6ACGT GCGATG0C6C GGGTCCGCTG CCCATGTCAC 

15401 6AAAGGTCGA CGCGTAAAAC GTGTTTTGCG ACCCGGCACC ACCGTAGTCT 
CTTTCCAOCT GCGCATTTT6 CACAAAAC6C TCGGCCGTGG TGGCATCAGA 

15451 TTACGCCCGG TOAGCGCTCC ACCCGCACCT ACAAGCGCGT GTATGATGAG 
AAT6CGGGCC ACTCGCGAGG TGGGCGTGGA TGTTCGCGCA CATACTACTC 

15501 GTGTACCCCG ACGAGC5ACCT GCTTGAGCAG GCCAACGAGC GCCTCGGGGA 
CACATOCCGC TGCTCCTGGA CGAACTCGTC CGGTTGCTCG CGGAGCCCCT 

15551 GTTTGCCTAC GGAAAGCGGC ATAAGGACAT GCTGGCGTTG CCGCTGGACG 
CAAACGGATC CCTTrCCCCG TATTCCTGTA CGACCGCAAC GGCGACCTGC 

15601 AGGGCAACCC AACACCTAGC CTAAAGCCCG TAACACTGCA GCAGGTGCTG 
TCCCGTTGGG TTGTGGATCG GATTTCGGGC ATTGT6ACGT CGTCCACGAC 

15551 CCCGCGCTT6 CACCGTCCGA AGAAAAGCQC GGCCTAAAGC GCGAGTCTGC 
GGGCGCGAAC GTCGCAOGCT TCTTTTCGCX5 CCGGATTTCG C9CTCA6ACC 

15701 TGACTTGGCA CCCACCGTCC ACCTGATGGT ACCCAAGCGC CAGCGACTQG 
ACTGAACCGT GOGTGGCACG TCGACTACCA TCGGTTCGCG 6TC6CTGACC 

15751 AA6ATGTCTT GGAAAAAATG ACCGTGGAftC CTGGGCTGGA 6CCCGAGGTC 
TTCTACAGAA CCTTTTTTAC TCCCACCTTC CACCCGACCT CiSGGCTCCAG 

15B01 CGCGTGCGGC CAATCAAGCA GGTGGCGCCG GGACTGGGCG T3CAGACCGT 
GCGCACGCCG GTTAGTTCGT CCACCGCGGC CCTGACCCGC ACGTCTGGCA 

15B51 GGACGTTCAG ATACCCACTA CCAGTAGCAC CAGTATTGCC ACCGCCACAG 
CCT6CAAG<rC TA?6GGTGAT GGTCATCGTG GTCAIAACGG TQGCGGTGTC 

15901 AGGGCATGGA GACACAAACG TCCCCCGITC CCTCAGCCGT CCCGGATGCC 
TCCCGTACCT CT6TGTTTGC AGGGGCCAAC 06AGTCGCCA CCGCCTACGG 

15951 6CGGTGCAG6 CGGTCGCT6C GGCCGCGTCC AAGACCTCTA CGGAGGTGCA 
CGCCACGTCC GCCAGCGACG CCGGCGCAGG TTCTGGAGAT GOCTCCACGT 

16001 AACGGACCCG TGGATGTTTC GCGTTTC^C CCCCCGGCGC CCGCGCCGTT 
TTGOCTGGGC ACCTACAAAG CGCAAAGTCG GGGG3CCGCG GGCGCGGCAA 
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16051 
16101 
16151 
16201 
16251 
16301 
16351 
16401 
16451 
16501 
16551 
16601 
16651 
15701 
16751 
16801 
16851 
16901 
16951 



CGAGG7AGTA CBirGCCGCC AGCGCOCTAC TCCCCGAATA TGCCCTfl^TT 
GCTCCTTCAT GCCGCGGCGG TCGCGCGATG ACGGGCTTAT ACGGGATGTA 

CCTTCCATTG CGCCTACCCC CGGCTATCGT GGCTACACCT ACCGCCCCAG 
GGAAGGTAAC GCGGATGGGG GCCGATAGCA CCGATGTGGA T66CGGGGTC 

AAGACGAGCA ACTACOCGAC GCCGAACCAC CACTGGAACC CGCCGCCGCC 
TTCTGCTCGT TGATGGGCTG CGGCTTGGTG GT6ACCTTGG GCGGCGGCGG 

6TCGCCGTCG CCAGCCCGTG CTGGCCCCGA TTTCCGTGCG CAGGGTOGCT 
CAGCGGCAGC GGTCGlSGCAC GACCGGGG^ AAA6GCACGC GTCCCACCGA 

CGCGAAGGAG GCAGGACCCT GGTGCTGCCA ACAGCGCGCT ACCACCCCAG 
GCGCTTCCTC CGTCCTGGGA CCACGACGGT TGTCGCGCGA TGGTGGGGTC 

CATCGTTTAA AAGCCGGTCT TTGTGGTTCT TGCAGATATG GCCCTCACCT 
GTAGCAAATT TTCGGCCAGA AACACCAAGA ACGTCTATAC CGGGAGT6GA 

GCCGCCTCCG TTTCCCGGTG CCG6GATTCC GAGGAAGAAT GCACCGTAGG 
CGGCGGAGGC AAA6GGCCAC GGCCCTAAGG CTCCTTCTTA CGTGGCATCC 

AGGGGCATGG CCGGCCACGG CCTGACGGGC GGCATGCGTC GTGCGCACCA 
TCCCCGTACC GGCCGGTGCC GGACTGCCCG CCGTACGCAG CACGCGTGGT 

GCGGCGGCGG CGCGCGTCGC ACCGTCGCAT GCGC6GCGGT ATCCTGCCCC 
GGCCGCCGCC GCGCGCAGCG TGGCAGCGTA CGCGCCGCCA TAGGACGGGG 

TCCTTATTCC ACTGATCGCC GCGGCGATTG GCGCCGTGCC CGGAATTGCA 
AGGAATAAGG TGACTAGCGG CGCCGCTAAC CGCGGCACGG GCCTTAACGT 

TCCGTGGCCT T6CAGGCGCA GA6ACACTGA . TTAAAAACAA GTTGCATGTG 
AGCCACCCGA ACGICCGCGT CTCTGTGACT AATTTTTGTT CAACGTACAC 

GAAAAATCAA AATAAAAAGT CTGGACTCTC ACGCTCGCTT GGTCCTGTAA 
CTTTTTAGTT TTATTTTTCA GACCTGA6AG TGCGAGCGAA CCAGGACATT 

CTATTTTGTA GAATGGAAGA CATCAACTTT GCGTCTCTGG CCCCGCGACA 
GATAAAACAT CTTACCTTCT CTAGTTGAAA CGCAGAGACC GGGGCGCTGT 

CGGCTGGCGC CCGTTCATGG GAAACTGGCA AGATATCGGC ACCAGCAATA 
GCCGAGCGCG GGCAAGTACC CTTTGACCGT TCTATAGCCG TGGTCGTTAT 

TGAGCGGTGG CGCCTTCAOC TGGGGCTCGC TGTGQAGCGG CATTAAAAAT 
ACTCGCCACC GCGGAAGTCG ACCCCGAGCG ACACCTCGCC GTAATTTTTA 

TTCGGTTCCA CC6TTAAGAA CTATCGCAGC AAGGCCTGGA ACAGCAGCAC 
AAGCCAAGGT GGCAATTCTT GATACCGTCG TTCCOGACCT TGTCGTCGTG 

AGGCCA6ATG CT6A6GGATA AGTTGAAAGA GCAAAATTTC CAACAAAAGG 
TCCGGTCTAC GACTCCCTAT TCAACTTTCT CGTTTTAAAG GTTGTTTTCC 

TGGTAGATGG CCTGGCCTCT GGCATTAGCG GG6TGGTGGA CCT6GCCAAC 
ACCATCTACC GGACCGGAGA CC67AATCGC CCCACCACCT GGACCGGTTG 

CAGGCAGTGC AAAATAA6AT TAACAGTAA6 CTTGATCCCC GCCCTCCCGT 
GTCCGTCT^ TTTTATTCTA ATTCTCATTC GAACTAGGGG C6GGAGGGCA 
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17001 
17051 
17101 
17151 
17201 
17251 
17301 
17351 
17401 
17451 
17501 
17551 
17601 
17S51 
17701 
17751 
17B01 
17B51 
17901 



AGAGGAGCCT O^CGGCCG" TGGAGACAGT GTCTCCAGAC 'CGGCG1<aBB3 

TCTCCTCGGA gSIEgccggc ACCTCTGTCA cagaggtctc cccoca<S5c 

AAAAGCGTCC GCGCCXTCGAC AGGGAAGAAA CTCTGGTGAC GCAAATAGAC 
TTTTCGCAGG CGCGGGGCTG TCCCTTCTTT GAGACCACTG CGTTTATCT3 

GAGCCTCCCT CGTACGAGGA G6CACTAAAG CAA6GCCTGC CCACCACCCG 
CTCGGAGGGA QCATGCTCCT CCGTGATTTC GTTCCGGACG GGTG6TGGGC 

TCCCATCGCG CCCATGGCTA CCGGAGTGCT GGGCCAGCAC ACACCCGTAA 
AGGGTAGCGC GGGTACCGAT GGCCTCAC6A CCCG6TCGTG TGTOGQCATT 

CGCTGGACCT GCCTCCCCCC GCCGACACCC AGCACAAACC TGTOCTCCCA 
GCGACCTGGA 06GAGGQGGG CGGCTGTG6G TCGTCTTTGG ACACGAC6GT 

GGOCCGACCG OCGTTGTTGT AACCCGTCCT AGCCGCGCGT CCCTGCGCCG 
CCGGGCTGGC OGCAACAACA TTGGGCAGGA TCGOCGCGCA GGGACGCGGC 

CGCCGCCAGC OGTCCGCGAT CGTTGCGCCC CGTAGCCAGT GGCAACTGGC 
GCGGCGGTCG CXAGGCGCTA GCAACGCC6G GCATCGGTCA CCGTTGACCG 

AAASCACACT GAACA6CATC GTGGGTCTGG GGGT6CAATC CCTGAAGCGC 
TTTCGTGTGA CTTGrCGTAG CACCCAGACC CCCACGTTA6 GGACTTCGCG 

CGACGATGCT TCTGATAGCT AACGTGTCGT ATCTGTGTCA TCTATGCCTC 
GCTGCTAC6A AGACTATCGA TTGCACA6CA TACACACAGT ACATACGCAG 

CATGTCGCCG CCAGAGGAGC TGCTGAGCCG CCGCGCGCCC GCTTTCCAAG 
GTACAGC6GC GGTCTCCTCG ACGACTC6GC GGCGCGCGGG C3AAAGGTTC 

ATGGCTACCC CTTCGATGAT GCCGCAGTGG TCTTACATGC ACATCTCGGG 
TACCGATCCC GAACCTACTA CGCCGTCACC AGAATGTACG T3TAGAGCCC 

CCAGGACGCC TCCGAGTACC TGAGCCCCG6 6CTG6TGCAG TTTGCCCGCG 
GGTCCTGCGG AGCCTCATGG ACTCGGGGCC CGACCACGTC AAACGGGCGC 

CCACCGAGAC GTACTTCAGC CTGAATAACA AGTTTAGAAA CCCCACGGTG 
GGTGGCTCTG CA7GAAGTCG GACTTATTGT TCAAATCTTT G3GGTGCCAC 

GCGCCTACCC ACCACGTGAC CACAGACC6C TCCCAGCCTT TSACGCTGCG 
CGCGGATGCO T0CT3CACTG GTGTCTOGCC A60GTC0CAA ACTGCGACGC 

6TTCATCCCT GTGGACCGTG AGGATACTCC GTACTC6TAC AAGGCGCG6T 
CAAGTAGGGA CACCTGGCAC TCCTATGACG CATGAGCATG TTCCGCGCCA 

TCACCCTAGC TG?GGGTGAT AACCGTGTGC TGGACATGGC TTCCACGTAC 
AGT6GGATCG ACACCCACTA TTGGCACACG ACCT6TACCG AAGGT6CATC 

TTTGACATCC GCGGCGTGCT 6GACA0GG9C CCTACTTTTA AGCCCTACTC 
AARCTGTAGG CGCCQCAOGA CCTGTCCCCG GGATGAAAAT TCGGGA3X3AG 

TGGCACTGCC TACAACGCCC TGGCTCCCAA GGGTGCCCCA AATCCTTGCG 
ACCGTGACGG AT6TTGCGGG ACCGAGGGTT CCCACGGGGT- TTAGGAACGC 

AATGGGATGA AGCTGCTACT GCTCTTGAAA TAAACCTAGA AGAAGAGGAC 
TTACCCTACT TC6ACGATGA CGAGAACTTT ATTTGGATCT ICTTCTCCTG 
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17951 GATGACAACG AJ^RTGAAGT AGACGAGCAA GCTGAGCAGC AAAAAAfl^^ 
CTACT6TTGC TTCTGCTTCA TCTGCTCGTT CGACTCGTCG TTTTTTGAGT 

18001 CGTATTTGGG CAGGCGCCTT ATTCTGGTAT AAATATTACA AAGGAGGGTA 
GCATAAACCC GTCCGCGGAA TAAGACCATA TTTATAATGT TTCCTCCCAT 

18051 TTCAAATAGG TCTCGAAGGT CAAACACCTA AATATGCCGA TAAAACATTT 
AAGTTTATCC ACAGCTTCCA GTTTGTGGAT TTATACGGOT ATTTTGTAAA 

leiOl CAACCTGAAC CTCAAATAGG AGAATCTCAC T6GTACGAAA CAGAAATTAA 
GTTGGACTTG GAGTTTATCC TCTTAGAGTC ACCATGCTTT GTCTTTAATT 

18151 TCATGCAGCT GGGAGAGTCC TAAAAAAGAC TACCCCAATG AAACCAT6TT 
AGTACGTCGA CCCTCTCAGG ATTTTTTCTG ATGGGGTTAC TTTGGTACAA 

18201 ACGGTTCATA TGCAAAACCC ACAAATGAAA ATGGAGGGCA AGGCATTCTT 
TGCCAAGTAT ACGTTTTGGG TGTTTACTTT TACCTCCCGT TCCGTAAGAA 

18251 GTAAAGCAAC AAAATGGAAA GCTAGAAAGT CAAGTGGAAA TGCAATTTTT 
CATTTCGTTG TTTTACCTTT CGATCTTTCA GTTCACCTTT ACGTTAAAAA 

18301 CTCAACTACT GAGGCAGCCG CAGGCAATGG TGATAACTTG AXTCCTAAAG 
GAGTTGATGA CTCCGTCGGC GTCCGTTACC ACTATTGAAC TGAGGATTTC 

18351 TGGTATTGTA CAGTGAAGAT GTAGATATAG AAACCCCA6A CACTCATATT 
ACCATAACAT GTCACTTCTA CATCTATATC TTTGGGGTCT GTGAGTATAA 

18401 TCTTACATGC CCACTATTAA GGAAGGTAAC TCACGAGAAC TAATGGGCCA 
AGAArGTACG 6GTGATAATT CCTTCCATTG AGTGCTCTTG ATTACCCGGT 

18451 ACAATCTATG CCCAACAGGC CTAATOACAT. TGCTTTTAGG 

TGTTAGATAC GGGTTGTCCG GATTAATGTA ACGAAAATCC CTGTTAAAAT 

18501 TTGGTCTAAT GTATTACAAC AGCACGGGTA ATATGGGTGT TCTGGCGGGC 
AACCAGATTA CATAATGTTG TCGTGCCCAT TATACCCACA AGACCGCCCG 

18551 CAAGCATCGC AGTTGAATGC TGTTGTAGAT TTGCAAGACA GAAACACAGA 
GTTCGTAGCG TCAACTTACG ACAACATCTA AACGTTCTGT CTTTGTGTCT 

18601 GCTTTCATAC CAGCTTTTGC TTGATTC'IZAT TGGTGATAGA ACCAGGTACT 
CGAAAGTAT6 GTCGAAAACG AACTAAG5TA ACCACTATCT TGGTCCATGA 

18651 TTTCTATGTG GAATGAGGCT GTTGACAGCT ATGATCCAGA TGTTAGAATT 
AAAGATACAC CTTAGTCCGA CAACT6TCGA TACTAGGTCT ACAATCTTAA 

18701 ATTGAAAATC ATGGAACTGA AGATGAACTT CCAAATTACT GCTTTCCACT 
TAACTTTTAG TACCTTGACT TCTACTT3AA GGTTTAATGA CGAAAGGTGA 

18751 GGGAGGTGTG ATTAATACAG AGACTCTTAC CAAGGTAAAA CCTAAAACAG 
CCCTCCACAC TAATTATGTC TCTGA6AATG GTTCCATTTT GGATTTTGTC 

18B01 GTCAGGAAAA TGGATGGGAA AAAGATCCTA CAGAATTTTC AGATAAAAAT 
CAGTCCTTTT ACCIACCCTT TTTCTACGAT GTCTTAAAAG ICTATTTTTA 

18B51 GAAATAAGAG TTGGAAATAA TTTTGCCATG GAAATCAATC TAAATGCCAA 
CTTTATTCTC AACCTTTATT AAAACGGTAC CTTTAGTTAG ATTTACGGTT 
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XBBOl CCTGTGGAGA AI^TCCTGT ACTCCAACAT AGCGCTGTAT TTCC'CC^BPA 
GGACACCICT TTAAAGGACA TGAG6TTGTA TCGCX^ACATA AACG6GCTGT 

18951 AGCTAAAGTA CAGTCCTTCC AACGTAAAAA TTTCTGATAA CCXAAACACC 
TCGATTTCAT GTCAGGAAGG TTGCATTTTT AAAGACTATT GQGTTTGTGG 

19001 TACGACTACA TGAACAAGCG AGTGGTGGCT CCCGGC3CTAG TGGACTGCTA 
ATGCTGATGT ACTTGTTCGC TCACCACCGA GGGCCCGATC ACXTTGACGAT 

19051 CATTAACCTT GGAGCACGCT GGTCCCTTGA CTATATGGAC AACGTCAACC 
GTAATTGGAA CCTCGTGCGA CCA6GGAACT GATATACCTG TTGCAGTTGG 

19101 CATTTAACCA CCACCGCAAT GCTGGCCTGC 6CTACCGCTC AATGTTGCTG 
GTAAATTGG7 GGTGGCGTTA CGACCGGACG C6ATGGCGAG TTACAACGAC 

19151 GGdAATGGTC GCTATGTGCC CTTCCACATC CACGTCCCTC AGAAGTTCTT 
CC6TTACCAG CGATACACGG GAAGGTGTAG GTCCACGGAG TCTTCAAGAA 

19201 TGCCATTAAA AACCTCCTTC TCCTGCCGQG CTCATACACC TACGAGTGGA 
ACGGTAATTT TTGGAGGAA6 AGGACGGCCC GAGTATGTGG ATGCTCACCT 

19251 ACTTCAGGAA GGATGTTAAC ATGGTTCTGC AGAGCTCCCT AGGAAATGAC 
TGAAGTCCTT CCTACAATTG TACCAAGACG TCTCGAGGGA TCCTTTACTG 

19301 CTAAGGGTTG ACGGAGCCAG CATTAAGTTT GATAGCATTT GCCTTTACGC 
GATTCCCAAC TGCCTCGGTC GTAATTCAAA CTATCGTAAA CGGAAATGCG 

19351 CACCTTCTTC CCCATGGCCC ACAACACCGC CTCCACGCTT 3AGGCCATGC 
GTGGAAGAAG GGGTACCGGG TGTTGTGGCG GAGGTGCGAA CTCCGGTACG 

19401 TTAGAAACGA CACCAACGAC CAGTCCTTTA ACGACTATCT ll^rCCGCCGCC 
AATCTTTGCT GTG3TTGCTG GTCAGGAAAT TGCTGATAGA 3AGGCGGCGG 

19451 AACATGCTCT acc::tatacc CCCCAACGCT ACCAACGTGC CCATATCCAT 
TT6TACGAGA TGG5ATATGG GC6GTT6CGA TGGTTGCACG GGTATAGGTA 

19501 CCCCTCCCGC AACTGGGCGG CTTTCC6CGG CTGGGCCTTC ACGCGCCTTA 
GGGGAGGGCG TTGACCCGCC GAAAGGCGCC GACCCGGAAG TGC6CGGAAT 

19551 AGACTAAGGA AACCCCATCA CTGGGCTCGC GCTACGACCC TTATTACACC 
TCTGATTCCT TTGGGGTAGT GACCCGA3CC CGATGCTGGG AATAATGTGG 

19601 TACTCTGGCT CTATACCCTA CCTAGATGGA ACCTTTTACC TCAACCACAC 
ATGAGACCGA GATAT6GGAT GGATCTACCT TGGAAAATGG AGTTGGTGTG 

19651 CTTTAAGAAG GTGGCCATTA CCTTTGACTC TTCTGTCAGC TGGCCTGGCA 
GAAATTCTTC CACCG6TAAT GGAAACTGAG AAGACAGTC6 ACC6GACCG7 

19701 ATGACCGCCT GCTTACCCCC AACGA6TTTG AAATTAAGCG CTCAQTTGAC 
TACTGGCGGA CGAATG6GG6 TTGCTCAAAC TTTAATTCGC GA6TCAACTG 

19751 GGGGAGGGTT ACAAC6TTGC CCAGTGTAAC ATGACCAAAG ACTGOTTCCT 
CCCCTCCCAA TGTTGCyUM:e GGTCACAIT6 TACTGGTTTC TGACCAAGGA 

19801 GGTACAAATG CTAGCTAACT ATAACATTGG CTACCAGGGC TTCTATATCC 
CCATGTTTAC GATCGATTGA TATTGTAACC GATGGTCCCG AAGATATAGG 
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19851 CAGAGAGCTA Ci^tSACCGC • ATGTACTCCT TCTTTAGAAA CTTCCA«|[pt: 
GTCTCTCGAT GTTCCTGGC6 TACATG^GGA AGAAATCTTT GAAGG7CG0G 

19901 ATX5AGCCGTC AGGT6GT6GA TGATACTAAA TACAAGGACT ACCAACAGGT 
TACTCGGCAG TCCACCACCT ACTATGATTT ATGTTCCT6A TG6TTGTCCA 

19951 GGGCATCCTA CACCAACACA ACAACTCTGG ATTTGTTGGC TACCTTGCCC 
CCCGTAGSAT GTGGTT6TGT TGTTGAGACC 7AAACAACCG ATGGAACG6G 

20001 CCACCATGC6 CGAAGGACAG GCCTACCCTG CTAACTTCCC CTATCCGCTT 
GGTGGTACGC GCTTCCTG7C CGGATGGGAC GATTGAAGGG GATAGGCGAA 

20051 ATAGGCAAGA CCGCAGTTGA CAGCATTACC CAGAAAAAGT TTCTTTGCGA 
TATCCGTTCT GGC6TCAACT GTCGTAATG6 GTCTTTTTCA AAGAAACGCT 

20101 TCGCACCCTT TG6CGCATCC CATTCTCCAG TAACTTTAT6 TCCATGG6CG 
AGCGTGGGAA ACCGCGTAG6 GTAAGAGGTC ATT6AAATAC AGGXACCCGC 

20151 CACTCACA6A CCTG6GCCAA AACCTTCTCT ACGCCAACTC CGCCCACGCG 
GTGAGTGTCT GGACCCGGTT TTGGAAGAGA TGCGGTTGAG GCGGGTOCGC 

20201 CTAGACATGA CTTTTGAGGT GGATCCCATG GACGAGCCCA CCCTTCTTTA 
GATCTGTACT GAAAACTCCA CCTAGG6TAC CTGCTCGGGT GGGAAGAAAT 

20251 TGTTTTGTOT GAAGTCTTTG ACGTGGTCCG TGTGCACCAG CCGCACCGCG 
ACAAAACAAA CTTCAGAAAC TGCACCAGGC ACACGTGGTC GGCGTGGCGC 

20301 GCGTCATCGA AACCGTGTAC CTGCGCACGC CCTTCTCGGC CGGCAACGCC 
CGCAGTAGCT TT6GCACATG GACGCGTGCG GGAAGAGCCG GCCGTTGCGG 

20351 ACAACATAAA GAAGCAAGCA ACATCAACAA CAGCTGCCGC CATGGGC7CC 
TGTTGTATTT CTTCGTTCGT TGTAGTTGTT (HtGACGGCG" GTACCCGAGG 

20401 AGTGAGCA6G AACTGAAAGC CATTGTCAAA GATCTTGGTT GTGGGCCATA 
TCACTCGTCC TTGACTTTCG GTAACAGTTT CTAGAACCAA CACCCGGTAT 

20451 TTTTTTGGGC ACTTATGACA AGCGCTTTCC AGGCTTTGTT TCTCCACACA 
AAAAAACCCG TGGATACTGT TCGCGAAAGG TCCGAAACAA AGAGGTGTGT 

20501 A6CTCGCCTG COCCATAGTC AATACGGCCG GTC6CGAGAC T6GGG6C6TA 
TCGAGCGGAC GCGGTATCAG TTATGCCG6C CAGCGCTCTG ACCCCCGCA7 

20551 CACTGGAT6G CCTTTGCXnKS GAACCCGCAC TCAAAAACAT CCTACCTCTT 
GTGACCTACC GGAAACGGAC CTTGGGCGT6 AGTTTTTGTA Ca^TGGAGAA 

20601 TGAGCCCTTT GOCTTTTCTG ACCAGCGACT CAAGCA6GTT TACCAGTTTG 
ACTCGGGAAA CC6AAAA6AC TGGTCGCTGA GTTCGTCCAA ATGGTCAAAC 

20651 AGTACGAGTC ACTCCTGCGC CGXA6CGCCA TTGCTTCTTC CCCCGACCGC 
TCATGCTCAG TGAGGACGCG GCATCGCGGT AACGAAGAAG GGGGCTGGCG 

20701 TGTATAACGC TGGAAAAGTC CACCCAAAGC GTACAGGGGC CCACTCGGC 
ACATATTGCG ACSZTTTTCAG GTGGGOTTCG CATGTCCCCG GGTTGAGCCG 

20751 CGCCTGTGGA CTATTCT6CT GCATGTTTCT GCACGCCTTT GCCAACTGGC 
GCGGACACC7 GATAAGACGA CGTACAAAGA GGTGCGGAAA CGGTTGACCG 
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20801 CCCAAACTCC di^GATCAC AACCCCACCA TGAACCTTAT TACCCG^pS" 
•GGGTTTGAGG'GTACCTAGTG TTGGGGTGGT ACTTGGAATA ATGGCCCCAT 

20851 CCCAACTCCA TGCTCAACAG TCCCCAGGTA CAGCCCACCC TQCGTCGCAA 
GGGTTGAGGT ACGAGTTGTC AGGGGTCCAT GTCGGGTGGG ACGCAGCGTT 

20901 CCAOGAACAG CTCTACAGCT TCCTGGAGCG CCACTCGCCC TACTTCCGCA 
QGTCCTTGTC GAGATGTCGA AGGACCTCGC GGTGAGCGGG ATGAAGGCGT 

20951 GCCACAGTGC GCA6ATTAGG AGCGCCACrT CTTTTTGTCA CTTGAAAAAC 
CGGTGTCACG CGTCTAATCC- TCGCGGTGAA GAAAAACA£3T GAACTTTTTG 

21001 ATGTAAAAAT AATGTACTAG AGACACTTTC AATAAAGGCA AATGCTTTTA 
TACATTTTTA TTACATGATC TCTGTGAAAG TTATTTCCGT TIACGAAAAT 

21051 TTTCTACACT CTCGGGTGAT TATTTACCCC CACCCTTGCC GTCTGCGCCG 
AAACATGTGA GAGCCCACTA ATAAATGGGG GTGGGAACGC CAGACGCGGC 

21101 TTTAA?iAATC AAAGGGGTTC TGCCGCGCAT CGCTATGCGC CACTGGCAGG 
AAATTTTTAG TTTCCCCAAG AC6GCGCGTA GCGATACGCG GTGACCGTCC 

21151 GACACGTTGC GATACTGGTG TTTAGTGCTC CACTTAAACT CAGGCACAAC 
CTGTGCAACG CTATGACCAC AAATCACGAG GTGAATTTGA GTCCGTGTTG 

21201 CATCCGCGGC AGCTCGGTGA AGmTCACT CCACAGGCTG CGCACCATCA 
GTAGGCGCCG TCGAGCCACT TCAAAAGTGA GGTGTCCGAC GCGTG6TAQT 

21251 CCAACGCGTT TAGCAGGTCG GGCGCCGATA TCTTGAAGTC GCAGTTGGGG 
GGTTOCGCAA ATCGTCCAGC CCGCGGCTAT AGAACTTCAG CGTCAACCCC 

21301 CCTCCGCCCT GCGCGCGCGA GTTGCGATAC ACAGGGTTGC AGCACTGGAA 
GGAGGCGGGA CCC3CGCGCT CAACGCTATG TGTCCCAACG TCGTGACCTT 

21351 CACTATCAGC GCCSGGTGGT GCACGCTGGC CA6CACGCTC TTGTCGGAGA- 
GTGATAGTCG C6GCCCACCA CGTGC6ACCG GTCGTGCGAG AACAGCCTCT 

21401 TCAGATCCGC GTCCAGGTCC TCCGCGTTGC TCAGGGCGAA CGGAGTCAAC 
AGTCTAGGCG CAGGTCCAGG AGGCGCAACG AGTCCCGCTT GCCTCAGTTG 

21451 TTTGGTAGCT CCCTTCCCAA AAACGGCGCG TGCCCAGGCT TTGAGTTGCA 
AAACCATCGA CGSAAGGGTT TTTCCCQCGC ACGGGTCCGA AACTCAACGT 

21501 CTC6CACCGT AGTGGCATCA AAAGGTGACC 6TGCCCGGTC TGGGCGTTAG 
GAGCGTGGCA TCACCGTAGT TTTCCACnGG CACGGGCCAG ACCCGCAATC 

21551 GATACAGCGC CTQCATAAAA GCCTTGATCT GCTTAAAAGC CACCTCAGCC 
CTATGTCGCG GACGTATTTT CGGAACTAGA CGAATTTTCG GTGQACTCGG 

21601 TTTGCGCCTT CAGAGAAOAA CATOCCGCAA GACTTGCCGG AAAACTGATT 
AAACGdGGAA GTClCTOCTt GTACGGCGTT CTGAACGGCC TTTTGACTAA 

21651 GGCCGGACAG GCCGCGTCGT OCACGCAGCA CCTTGCGTC6 GTGTTGGAGA 
CCGGCCTCTC CGSCGCAGCA CGTGCGTCiGT GGAACGCAGC CACAACCTCT 

21701 TCTGCACCAC ATTTCGGCCC CACCGGTTCT TCACGATCTT 6GCCTTGCTA 
AGACGTGGTG TAAAGCCGGG GTGGCCAAGA AGTGCTAGAA CCGGAACGAT 
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21751 GACT6CTCCT T^^CGCGCG CTGCCCX3TTT TCGCTCGTCA CAUXXA'^pC 
CTGACGTlGGA AGTCGCGCGC GACGGGCAAA A6CGAGCAGT GTAGQTAAAG 

21801 AATCACGTGC TOCTTATTTA TCATAAT6CT TCCGTGTAGA CACTTAAGCT 
TTAGT6CACG AGGAATAAAT AGTATTACGA AGGCACATCT GTGAATTCGA 

21851 CGCCTTCGAT CTCAGCQCA6 CQGTGCAGCC ACAACGCGCA GCCCGT6GGC 
GCGGAAGCTA GAGTCGCGTC GCXJICGTCGG TGTTGCGCGT CG5GCACCCG 

21901 TCGTGATGCT TGTAGGTCAC CTCTGCAAAC 6ACT6CAGGT ACGCCTGCAG 
AGCACTACGA ACATCCAGTG GAGACGT7TG CTGACGTCCA-TGCGGACGTC 

21951 GAATCGCCCC ATCATCGTCA CAAAGGTCTT GTTGCT6GTG AAGGTCA6CT 
CTIAGCX36(3G TAG^AGCAGT GTTTCCAGAA CAACGACCAC TTCCAGTCGA 

* 

22001 GCAACCCGCC GTCCTCCTCG TTCAGCCAGG TCTTGCATAC 66CCGCCAGA 
CGTTGGGCGC CACGAGGAGC AAGTCGGTCC AGAACGTATG CCGGCGGTCT 

22051 GCTTCCACTT GGTCA6GCA6 TAGTTTGAAG TTCGCCTTTA GATCGTTATC 
CGAAGGTGAA CCAGTCCGTC ATCAAACTTC AA6CGGAAAT CTAGCAATAG 

22101 CACGTGGTAC TTGTCCATCA QCGCGCGCGC AGCCTCCATG CCCTTCTCCC 
6TGCACCATG AACAGGTAGT CGC6CGCGCG TCGGAGGTAC GGGAAGAGGG 

22151 ACGCAGACAC GATCGGCACA CTCAGCGGGT TCATCACCGT AATTTCACTT 
TGCGTCTGTG CTAGCCGT6T GAGTCGCCCA AGTAGTGGCA TTAAAGTGAA 

22201 TCCGCTTCGC TGGGCTCTTC CTCTTCCTCT TGCGTCCGCA TACCACGCGC 
AGGCGAAGCG ACCCGAGAAG GAGAAGGAGA ACGCAGGCGT ATGGTGCGCG 

22251 CACTGGGTCG TCTTCATTCA GCCGCCGCAC TGTGCGCTTA CCTCCTTTGC 
GTGACCCAGC AGAAGTAAGT CGGCGGCGTG ACACGCGAAT GGAGGAAACG 

22301 CATGCTTGAT TAGCACCGGT GGGTTGCTGA AACCCACCAT TTGTAGCGCC 
GTACGAACTA ATCGTGGCCA CCCAACGACT TTGGGTGGTA W^CATCGCGG 

22351 ACATCTTCTC TTTCTTCCTC GCTGTCCACG ATTACCTCTG GTGATGGCGG 
TGTAGAAGAG AAAGAAGGAG CGACAGGTGC TAATGGAGAC CACTACCGCC 

• < 

22401 GCGCTCGGGC TTGGGAGAAG GGCGCTTCTT TTTCTTCTTG GGCGCAATGG 
CGCGAGCCCG AACCCTCTTC CCGCX3AAGAA AAAGAAGAAC CCGCGTTACC 

22451 CCAAATCCGC CGCCGAGGTC GATGGCCGCG GGCTGGGTGT GCGCGGCACC 
GGTTTAGGCG GCGGCTCCAG CTACCGGCGC CCGACCCACA CGCGCCGT6G 

22501 AGCGCGTCTT GTGATGAGTC TTCCTCGTCC TCGGACTCGA TACGCCGCCT 
TCGCGCAGAA CACTACTCAG AA6GAGCAGG AGCCTG^GCT ATGCGGCGGA 

22551 CATCCGCTTT TTTGGGGGCG CCCG6GGAGG CGGCGGCGAC GGGGACGGGG 
GTAGGCGAAA AAACCCCCGC GGGCCCCrcC GCCGCCGCTG CCCCTGCCCC 

22601 ACGACACGTC CTCCATGGTT GGGGGACGTC GCGCGGCACC GCGTCCGCGC 
TGCTGTGCAG GAGGTACCAI^ CCCCCTGCAG CGCGGCGTG6 C6CAGGCGCG 

22651 TCGGGGGTGG TTTCGCGCTG CTCCTCTTCC CGACTGGCCA TTTCCTTCTC 
AGCCCCCACC AAAGC6CGAC 6AGGA6AAGG GCTGACCGGT AAAGGAAGAG 
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22701 
22751 
22801 
22851 
22901 
22951 
23001 
23051 
23101 
23151 
23201 
23251 
23301 
23351 
23401 
23451 
23501 
23551 
23601 



GATATCCGTC TTTTTCTAGT ACCTCAQTCA GCTCTTCTTC CTGTCGGATT 

CCGCCCCCTC TGAGTTCGCC ACCACCGCCT CCACCGATGC CGCCAACGCG 
GGCGGG6GAG ACTCAAGCGG TGGTGGCGGA GGTGGCTACG GC6GTT6CGC 

CCTACCACCT TCCCCGTCGA GGCACCCCCG CTTGAGGAGG AGGAAGTGAT 
GGATGGTGGA AGGGGCAGCT CCGTGGGGGC GAACTCCTCC TCCTTCACTA 

TATCGAGCAG GACCCAGGTT TTGTAAGCGA AGACGACGAG GACCGCTCAG 
ATAGCTCGTC CTGCGTCCAA AACATTCCCT TCTGCTGCTC CTGGCGAGTC 

TACCAACAGA GGATAAAAAG CAAGACX:AGG ACAACGCAGA GGCAAACGAG 
ATGGTTGTCT CCTATTTTTC GTTCTGGTCC TGTTGCGTCT CCGTTTGCTC 

gaacaagtcg ggcgggggga cgaaaggcat ggcgactacc tagatgtggg 
cttgttcagc ccccccccct 6ctttccgta cccctgatgc atctacaccc 

agacgacgtg ctgttgaagc atctgcagcg ccagtgcgcc attatctgcg 
tctgctgcac gacaacttcg ta6acgtcgc ggtcacgcgg taata6acgc 

acgcgttgca agagcgcagc gatgtocccc tcgccatagc ggatgtcagc 

TGCGCAACGT TCTCGCGTC6 CTACACGGGG AGCGGTATCG CCTACAGTCG 

cttgcctacg aacgccacct attctcaccc cgcgtacccc ccaaacgcca 

GAACGGATGC TTGCGGT66A TAAGAGTGGC GCGCATGGGG OQTTTGCGGT 

AGAAAACGGC ACATGCGAGC CrCAACCCGCG CCTCAACTTC TACCCCGTAT 
TCTTTTGCCG TGTACGCTCG GGTTGGGCGC GGAGTTGAAG ATGGGGCATA 

TTGCCGTGCC AGAGGTGCTT GCCACCTATC ACATCTTTTT CCAAAACTGC 
AACGGCACGG TCTCCACGAA CGGTGGATAG TGTAGAAAAA GGTTTTGACG 

AAGATACCCC TATCCTGCCG TGCCAACCGC AGCCQAGCGG ACAAGCAGCT 
TTCTATGGGG ATAGGACGGC ACGGTTGGCG TCGGCTCGCC TGTTCGTCGA 

GGCCTTGCGG CTiGGGCGCTG TCATACCTCA TATCGCCTCG CTCAACGAAG 
CCGGAACGCC GTCCCGCGAC AGTATGGACr ATAGCGGAGC GAGTTGCTTC 

TGCCAAAAAT CTTTGAGGGT CTTGGACGCG ACGAGAAGCG CGCGGCAAAC 
ACGGTTTTTA GAAACTCCCA GAACCTGC6C TGCTCTTCGC GCGCCGTTTG 

GCTCTCCAAC AGGAAAACAG CGAAAATGAA ACTCACTCTG GAGTGTTGGT 
CGAGACGTTG TCCTTTTGTC GCTTTTACTT TCAGTGAGAC CTCACAACCA 

GGAACTCGAG GGTGACAAC6 CGC6CCTAGC CGTACTAAAA CGCAGCA7CG 
CCTTGAGCTC CCACTGTTGC G0GCGGATC6 6CATGATTTT OCGTCGTAOC 

AQQTCACCCA CTTTGCCTAC CCGGCACTTA ACCTACCCCC CAAGCTCATC 
TCCAGTGGGT GAAACGGATG GGCCGTGAAT TGGATGGGGG GTTCCAGTAC 

AGCACAGTCA TGAGTGA6CT GATCGTGCGC CGTGC6CAGC CCCTGGAGAG 
TCGTGTCA6T ACTCACTCGA COAGCACGCG GCACGC6TCG GGGACCTCTC 

6GATGCAAAT TTGCAAGAAC AAACAGAGGA GGGCCTACCC GCAGTTGGCG 
CCTACGTTTA AACCTTCTTG TTTGTCTCCT CCCGGATGGG CGTCAACCGC 
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23651 ACGA6CAGCT A^PCGCTGG CTTCJIAACGC GCGAGCCT6C CGACTT^G 
TGCTCGTCGA TCGCGCGACC GAAGTTTOCG CGCTCGGACG GCTGAACCTC 

237 OX GAGCGJ^GCA AACTAATGAT GGCCGCAGTG CTCGTTACCG TG6AGCTTGA 
CTCGCT6CGT TTGATTACTA CCGGC6TCAC GAGCAATGGC ACCTCGAACT 

23*751 GTGCATCCAC CGCTTCTTTG CTGACCCGGA GATCCAGCGC AAGCTAGAGG 
CACGTACGTC GCCAAGAAAC 6ACTGGQCCT CTACGTCGCG TTCGATCTCC 

23801 AAACATTGCA CTACACCTTT CGACA6GGCT ACGTACGCCA GGCCTGCAAG 
TTTGTAACCT 6AT6TQGAAA GCTGTCC3CGA TCCATGCGGT CCGGACGTTC 

23851 ATCTCCAACG TGGAGCTCTG CAACCTGGTC TCCTACCTTG GAATTTTGCA 
TAGAGGTTGC ACCTCGAQAC GTTGGACrAG AGGATGGAAC CTTAAAACGT 

23901 CGAAAACCGC CTTGGGCAAA ACGTGCTTCA TTCCACGCTC AAGGGCGAGG 
GCTTTTGGCG GAACCCGTTT TOCACGAACT AAGGTGCGA6 TTCCCGCTCC 

23951 CGCGCCGCGA CTACGTCCGC GACT6CGTTT ACTTATTTCT ATGCTACACC 
GCGCGGCGCT GATGCAGGCG CTGACGCAAA TGAATAAAGA TACGATGTGG 

24001 TGGCAGACG6 CCATGGGCGT TTGGCAGCAG TGCTTGGAG6 AGTGCAACCT 
ACCGTCTGCC G6TACCCGCA AACCGTCGTC ACGAACCTCC TCACGTTGGA 

24051 CAAGGAGCTG CAGAAACTGC TAAACCAAAA CTTGAAGGAC CTATGGACGG 
6TTCCTCQAC 6TCTTTGACG ATTTCGTTTT GAACTTCCTG GATACCTGCC 

24101 CCTTCAACGA GCGCTCCGTG GCCGCGCACC TGGCGGACAT CATTTTCCCC 
GGAAGTTGCT CGOGAGGCAC CGGCGCGTGG ACCGCCTGTA GTAAAAGGGG 

24151 GAACGCCTGC TTAAAACCCT GCAACAGGGT CTGCCAGACT TCACCAGTCA 
CTTGCGGACG AAOTTrGGGA CGTTGTCCCA GACGGTCTGA AGTGGTCAGT 

24201 AAGCATGTTG CAGAACTTTA GGAACTTTAT CCTAGAGCGC TCAGGAATCT 
TTCGTACAAC GTCTTGAAAT CCTTGAAATA GGATCTCGCG AGTCCTTAGA 

24251 TGCCCGCCAC CTGCTGTGCA CTTCCTAGCG ACTTTGTGCC CATTAAGTAC 
ACGGGCGGTG GACGACACGT GAAGGATCGC TGAAACACGG GTAATTCATG 

24301 CGC6AATGCC CTCCGCCGCT TTGGGGCCAC TGCTACCTTC TGCAGCTAGC 
GCGCTTACGG GAGGCGGCGA AACCCCGGTG ACGATGGAAG ACGTCGATCG 

24351 CAACTACCTT GCCTACCACT CTGACATAAT GGAAGACGTG AGCGGTGACG 
GTTGATGGAA CGGATGGT6A GACTGTATTA CCTTCTGCAC TCGCCACTGC 

24401 GTCTACTGGA GTGTCACTGT CGCTGCAACC TATGCACCCC GCACCGCTCC 
CAGATGACCT CACAGTGACA GCGACG7IGG ATACGTGGGG CGTGGCGAGG 

24451 CTGGTTTGCA ATTCGCAGCT GCTTAACGAA AGTCAAATTA TCGGTACCTT 
GACCAAACGT TAAGC6TCGA CGAATTGCTT TCAGTTTAAT AGCCATGGAA 

245C1 TGAGCTGCAG GGTCCCTCGC CTGACGAWl GTCCGCGGCT CCGGGGTTGA . 
ACTCGACGTC CCASGGAGCG GACTGCnTT CAGGCGCCGA GGCCCCAACT 

24551 AACTCACTCC GGG3CTGTGG ACGTCGGCTT ACCTTCGCAA ATTTGTACCT 
TTGAGTGAGG CCCCGACACC TGCAGCCGAA TGGAAGCGTT TAAACATGGA 
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24601 
24651 
24701 
24751 
24801 
24851 
24901 
24951 
25001 
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25151 
25201 
25251 
25301 
25351 
25401 
25451 
25501 



GAGCSACTACC A^SI^CCACGA GATTAC3GTTC TACGAAGACC AATCCClSl^C 
CTCCTGATGG TGCGGGTGCT CTAATCCAA6 ATGCTTCTGG TTAGGGCGGG 

« 

GCCTAATGCG GAGCTTACCG CCTGCGTCAT TACCCAGGGC CACATTCTTG 
CGGA7TACGC CTCGAAT6GC GGACGCAGTA ATGGGTCCCG GTGTAAGAAC 

GCCAATTGCA AGCCATCAAC AAAGCCCGCC AAGAGTTTCT GCTACGAAAG 
CGGTTAACGT TCGGTAGTTG TTTCGQGCGG TTCTCAAAGA CGATGCTTTC 

GGACGGGGGG TTTACrTGGA CCCCCAGTCC GGC6AGGA6C TCAACCCAAT 
CCTGCCCCCC AAATGAACCT GGGGGTCAGG CCGCTCCTC6 AGTTGGGTTA 

CCCCCCGCCG CCGCAGCCCT ATCAGCAGCA GCCGCGGGCC CTTGCTTCCC 
GGG6GGC6GC GGCGTCGGGA TAGTCGTCGT CGGCGCCC66 GAAC6AAGG6 

AGGATGGCAC CCAAAftAGAA GCTGCAGCTG CCGCCGCCAC CCACGGACGA 
TCCTACCGTiS GGTTTrTCTT CGACGTCGAC OOCGCCOGTC GGTGCCTGCT 

GGAGGAATAC TGGGACAGTC AGGCA6AGGA GGTTTTGGAC GAGGAGGAGG 
CCTCCTTATG ACCCTGTCAG TCCGTCTCCT CCAAAACCTG CTCCTCCTCC 

AGGACATGA7 GGAAGACTGG GAGAGCCTAG ACGAGGAAGC TTCCGAGGTC 
TCCTGTACTA CCTTCTGACC CTCTCGGATC TGCTCCTTCG AAGGCTCCAG 

GAAGAGGTGT CAGACGAAAC ACCGTCACCC TCGGTCGCAT 7CCCCTCGCC 
CTTCTCCACA GTCTGCTTTG TGGCAGTG6G AGCCAGC6TA AG3GGAGCGG 

GGCGCCCCAG AAATCGGCAA CCGGTTCCAG CATGGCTACA ACCTCCGCTC 
CCGCGGGGTC TTTAGCCGTT GGCCAAGGTC GTACCGATGT TGGAGGCGAG 

CTCAGGCGCC GOCGGCACTG CCC6TTCGCC GACCCAACCG TA3ATGGGAC 
GAGTCCGCGG CG6CCGTGAC GGGCAAGCGG CTGGGTTGGC ATCTACCCTG 

ACCACTGGAA CCAGGGCCGG TAAGTCCAAG CAGCCGCCGC CGTTAGCCCA 
TGGTGACCTT GGTCCCGGCC ATTCAGGTTC GTCGGCGGCG GCAATCGGGT 

AGAGCAACAA CAGCGCCAAG GCTACCGCTC ATGGCGCGGG CACAAGAACG 
TCTCGTTGTT GTCCCCGTTC CGATGQCGAG TACCGCGCCC . GT3TTCTTGC 

CCATAGTTGC TTGCTTGCAA GACTGTGGG6 GCAACATCTC CTTCGCCCGC 
GGTATCAACG AACGAACGTT CTGACACCCC CGTTGTAGAG GAAGCGGGCG 

CGCTTTCTTC TCTACCATCA CGGCGTGOCC TTCCCCCGTA ACATCCTGCA 
GCGAAAGAAG AGAT6GTAGT GCCGCACC6G AAGGGGGCAT TGTAGGACGT 

TTACTACCGT CATCTCTACA 6CCCATACTG CACCGGCGGC AGCGGCAGCA 
AATGATGGCA GTAGAGATGT CGGGTATOAC GTGGCCGCCG TCGCCGTCGT 

ACAGCAGCGG CCACACAGAA GCAAAG6CGA CCGGATAGCA AGACTCTGAC 
TGTCGTCGCC GGTGTGTCTT CGTTTCCGCT GGCCTATCGT TCTOAGACTG 

AAAGCCCAAG AAA?CCACAG CGGCGGCAGC AGCAGGAGGA GGAGCGCTGC 
TTTCGGGTTC TTTAGGT6TC GCCGCCGTCO TCGTCCTCCT CCTCGCGACG 

GTCTGGCGCC CAACGAACCC GTATCGACCC GCGAGCTTAG AAACAGGATT 
CAGACCGCGG GTTGCTTGGG CATAGCTGGG CGCTCGAATC TTTGTCCTAA 
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25551 TTTCCCACTC tS^TGCTAT ATTTCAACAG AGCAGGGGCC AAGAACJIP&I 

AAAGGGTGAG ACATACGATA TAAAGTTGTC TCGTCCCCGG TrCTTGTTCT 

25601 GCTGAAAATA AAAAACAGGT CTCT6CGATC CCTCACCCGC AGCTGCCTGT 
CGACTTTTAT TTTTTGTCCA GAGACGCTAG GGAGTGGGCG TCGACGGACA 

25651 ATCACAAAAG CGftAGATCAG CTTCGGCGCA CGCTGGAAGA CGCGGAGGCT 
TAGTGTTTTC GCTTCTAGTC GAAGCCGCGT GCGACCTTCT GCGCCTCCGA 

25701 CTCTTCACTA AATACTGCGC GCTGACTCTr AAGGACTAGT TTCGCGCCCT 
- 6AGAAGTCAT TTATGACGC6 CGACTGAGAA TTCCTGATCA AA6CGCGGGA 

25751 TTCTCAAATT TAAGCGCGAA AACTACGTCA TCTCCAGCGG CCACACCCGG 
AAGAGTTTAA ATTC6CGCTT TTGATGCAGT AGAGGTCGCC GGTQT6GGCC 

25801 CGCCAGCACC TGTTGTCAGC GCCATTATGA GCAAGGAAAT TCCCACGCCC 
GCGGTCGTGG ACAACAGTCG CGGTAATACT CGTTCCTTTA AGGGTGCGGG 

25851 TACATGTGGA GTTACCAGCC ACAAATG6GA CTTGCGGCTG GAGCTGCCCA 
ATGTACACCT CAATGGTCGG TGTTTACCCT GAACGCCGAC CTCGACGGGT 

25901 AGACTACTCA ACCCGAATAA ACTACATGAG CGCGGGACCC CA-CIATGATAT 
TCTGATGAGT TGGGCTTATT TGATGTACTC GCGCCCTGGG 6T3TACTATA 

25951 CCCGGGTCAA CGGAATACGC GCCCACCGAA ACC6AATTCT CCTGGAACAG 
GGGCCCAGTT GCCTTATGCG CGGGTGGCTT TGGCTTAAGA GGACCTTGTC 

26001 GCGGCTATTA CCACCACACC TCGTAATAAC CTTAATCCCC 6TAGTTGGCC 
CGCCGATAAT GGTGGTGTGG AGCATTATTG GAATTAGGGG CATCAACCGG 

„ . 2 6,0.51. CGCTGCCCTG. GTGTACCAGG AAAGTCO^ TCCCACCACT GTGGTACTTC 
GCGACGGGAC CACATGGTCC TTTCAGGGCG AGGGTGGTGA CACCATGAAG 

26101 CCAGAGACGC CCAGGCCGAA GTTCAGA'TGA CTAACTCAGG GGCGCAGCTT 
GGTCTCTGCG GGTCCGGCTT CAAGTCTACT GATTGAGTCC CCGCGTCGAA 

26151 GCGGGCGGCT TTCGTCACAG GGTGCGGTCG CCCGGGCAGG GTATAACTCA 
CGCCCGCCGA AAGCAGTGTC CCACGCCAGC GGGCCCGTCC CATATTGAGT 

26201 CCTGACAATC AGAGGGCGAG GTATTCAGCT CAACGACGAG TCGGTGAGCT 
GGACTGTTAG TCTCCTGCTC CATAAGTCGA GTTGCTGCTC AGCCACTCGA 

26251 CCTCGCTTGG TCTCCGTCCG GACGGGACAT TTCAGATCGG CGGCGCCGGC 
G6A6CGAACC AGA6GCAGGC C7GCCCTGTA AAGTCTAGCC GCCGCGGCCG 

26301 CGCTCTTCAT TCACGCCTCG TCAGGCAATC CTAACTCTGC AGACCTCGTC 
GCGAGAAGTA AGT6CGGA6C AGTCCGTTAG GATTGAGACG TCTGGAGCAG 

26351 CTCTGAGCCG CGCTCTGGAG GCATTGGAAC TCTGCAATTT ATTGAGGAGT 
GAGACTCGGC 6C6AGACCTC CGTAACCTTG AGACGTTAAA TAACTCCTCA 

26401 TTGTGCCATC GGTCTACTTT AACCCCTTCT CGGGACCTCC CGGCCACTAT 
AACACGGTAG CCAiGATGAAA TTGGGGAASA GCCCTG6A66 GCC6GTGATA 

26451 CCGGATCAAT TTATTCCTAA CTTTGACGCG GTAAAGGACT CGGCGGACGG 
GGCCTAGTTA AATAAGGATT GAAACTGCGC CATTTCCTGA GCCGCCTGCC. 
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26501 CTACGACTGA A^FtAAGTG GAGAGGCAGA GCAACTGCGC .CTGAAA^pC 
GATGCTGACT XACAATTCAC CTCTCCGTCT CGTTGACGCG GACTTTGTGG 

■ 

26551 tggtccactg tcgccgccac aagtgctttg cccgcgactc cggtgagttt 
accaggtgac agcggcggtg ttcacgaaac GGGCX^CTGAG gccactcaaa 

26501 TGCTACTTTG AATTGCCCGA GGATCATATC GAGGGCCCGG CGCACGGCGT 
ACGATGAAAC TTAACGGGCT CCTAGTATAG CTCCCGGGCC GCX3TGCCGCA 

26651 CCGGCTTACC GCCCAGGGAG AGCTTGCCCG TAGCCTGATT CGGGAGTTTA 
GGCCGAATGG CGGGTCCCTC TCGAACGGQC ATCGGACTAA GCrCCTCAAAT 

26701 CCCA6CGCCC CCTGCTAGTT GAGCGG6ACA 6GGGACCCTG TGTTCTCACT 
G66TCGCGGG GGACGATCAA CTCGCCCTGT CCCC7GGGAC ACAAGAGTGA 

26751 GTGATTTGCA ACTGTCCTAA CCCTGGATTA CATCAAGATC TTTGTTGCCA 
CACTAAACGT TGACAGGATT GGGACCTAAT GTAGTTCTA6 AAACAACGGT 

26B01 TCTCTGTGCT GAGTATAATA AATACAGAAA TTAAAATATA CTGGGGCTCC. 

AGAGACACGA CTCATATTAT TTATGTCTTT AATTTTATAT GACCCCGAGG 

2685.1 TATCGCCATC CTGTAAACGC CACCGTCTTC ACCCGCCCAA GCAAACCAAG 
ATAGCGGTAG GACATTTGCG GTGGCAGAAG TGGGCGGGTT CGTTTGGTTC 

26901 GCGAACCTTA CCT3GTACTT TTAACATCTC TCCCTCTGTG ATTTACAACA 
CGCTTGGAAT GGACCATGAA AATTGTAGAG AGGGAGACAC TAAATGTTGT 

26951 GrrrCAACCC AGACGGAGTG AGTCTACGAG AGAACCTCTC CGAGCTCAGC 
CAAAGTTGGG TCT3CCTCAC TCAGATGCTC TCTTGGAGAG 3CTCGAGTCG 

27001 TACTCCATCA GAAAAAACAC CACCCTCCTT ACCTGCCGGG AACGTACGAG 
ATGAGGTAGT CTTTTTTGTG GTGGGAG3AA TGGACGGCCC XTGCATGCTC 

27051 TGCGTCACCG CCCGCTGCAC CACACCTACC GCCTGACCGT AAACCAGACT 
ACGCAGTGGC CGGCGACGTG GTGTGGATGG CGGACTGGCA TTTGGTCTGA 

27101 TTTTCCGGAC AGACCTCAAT AACTCTGTTT ACCAGAACAG GAGGTGAGCT 
AAAAGGCCTG TCTGGAGTTA TTGAGACAAA TGGTCTTGTC CTCCACTCGA 

27151 TAGAAAACCC TTAGGGTATT AGGCCAAAGG CQCAGCTACT GTCGGGTTTA 
ATCTTTTGGG AATCCCATAA TCCGGTTTCC GCGTCGATGA CACCCCAAAT 

27201 TGAACAATTC AAGCAACTCT ACGGGCTATT CTAATTCAGG TTTCTCTAGA 
ACrTGTTAAG TTCGTTGAGA TGCCCGATAA GATTAAGTCC AAAGAGATCT 

27251 ATCGG6GTTG GGGTTATTCT CTGTCTTGT6 ATTCTCTTTA TTCTTATACT 
TAGCCCCAAC CCCAATAAGA GACAGAACAC TAAGAGAAAT AAGAATATGA 

27301 AACGCTTCrc TGCCTAAGGC TCGCCGCCTG CTOTGTGCAC ATTTOCATTT 
TTGCGAAGAG ACGGATTCC6 AGCGGCGGAC GACACACGTG TAAACGTAAA 

27351 ATTGTCAGCT TTTTAAACGC TGGGGTCGCC ACCCAAGAT6 ATTAGGTACA 
TAACAGTCGA AAAATTTCCG ACCCCAGCGG TGGGTTCTAC TAATCCATGT 

274 01 TAATCCTAC6 TTTACTCACC CTTGCGTCAG CCCACGGTAC CACCCAAAAG 
ATTAGGATCC AAATGAGTG6 GAACGCAGTC GGGTGCCATG GTG6CTTTTC 
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27451 GTOGATTTTA' A^P^CCAGC CTGTA\TGTT ACATTCGCAG CTGAAGiB^ 
CACCTAAAAT TCCTCGGTCG GACATTACAA TGTAAGCGTC GACTTCGATT 

27501 TGAGTOCACC ACTCTTATAA AATGCACCAC AGAACAT^SAA AAGCTGCTTA 
ACTCACGT6G TGAGAATATT TTACGTGGTG TCTTGTACTT TTCGACGAAT 

27551 TTCGCCACAA AAACAAAATT GGCAAGTATG CTGTTTATGC TATTTGGCAG 
AAGCGGTGTT TTTGTOTTAA CCGTTCATAC GACAAATACG ATAAACCGTC 

27601 CCAGGTGACA CTACAGAGTA TAATGTTACA GTTTTCCAGG GTAAAAGTCA 
GGTCCACTGT GATGTCTCAT ATTACAATGT CAAAAGGTCC CATTTTCAGT 

27651 TAAAACTTTT ATGTATACTT TTCCATTTTA T6AAATGTGC GACATTACCA 
ATTTTGAAAA TACATATGAA AAG6TAAAAT ACTTTACACG CTGTAAT6GT 

27701 TGTACATGAG CAAACAGTAT AAGTTGTGGC CCCCACAAAA TTCTGTGCAA 
ACATGTACTC GTTTGTCATA TTCAACACCG GGGGT6TTTT AACACACCTT 

27751 AACACTGGCA CTTTCTGCTG CACTGCXATG CTAATTACAG T6CTCGCTTT 
TTGTGACCGT GAAAGACGAC GTGACGA7AC GAT7AATGTC ACGAGCGAAA 

27801 GGTCTGTACC CTACTCTATA TTAAATACAA AAGCAGACGC AGCTTTATTG 
CCAGACATGG GAT6AGATAT AATTTATGTT TTCGTCTGCG TCGAAATAAC 

27851 AGGAAAAGAA AATGCCTTAA TTTACTAAGT TACAAAGCTA ATGTCACCAC 
TCCTTTTCTT TTACGGAATT AAATGATTCA ATGTTTCGAT TACAGTGGTG 

27901 TAACTGCTTT ACTCGCTGCT TGCAAAACAA ATTCAAAAAG TTAGCATTAT 
ATTGACGAAA TGAGCGACGA ACGTTTTGTT TAAGTTTTTC AATCGTAATA 

27951 AATTAGAATA. GGATTTAAAC CCCCCQGTCA TTTCCTGCTC AATACCATTC 
TTAATCTTAT CCTAAATTTG GGGGGCTAGT AAAGGACGAG TTATGGTAAG 

28001 CCCTGAACVl TTGACTCTAT GTGGGATATG CTCCAGCGCT ACAACCTTGA 
GGGACTTGTT AACTGAGATA CACCCTATAC GAGGTCGCGA TGTTGGAACT 

28051 AGTCAGGCTT CCTGGATGTC AGCATCTGAC TTTGGCCA6C ACCTGTCCCG 
TCAGTCCGAA 6GACCTACAG TCGTAGACTG AAACCGGTCG TGSACAGGGC 

28101 CGGATTTGTT CCAGTCCAAC TACAGCGACC CACCCTAACA GA3ATGACCA 
GCCTAAACAA GGTCAGGTTG ATGTCGCTGG GTGGGATTGT CTCTACTGGT 

28151 ACACAACCAA CGCGGCCGCC GCTACCGGAC TTACATCTAC CACAAATACA 
TGTGTTGGTT GCGCCGGCGG CGATGGCCT6 AATGTAGATG GTGTTTATGT 

28201 CCCCAAGTTT CTGCCTTTGT CAATAACTGG GATAACTTGG GCATGTGGTG 
6GGGTTCAAA 6ACG6AAACA GTTATTGACC CTATTGAACC CGTACACCAC 

28251 GTTCTCCATA GCGCTTATGT TTGTATGCCT TATTATTATG TGSCTCATCT 
CAAGAGGTAT CGC6AATACA AACATACGGA ATAATAATAC ACCGAGTAGA 

28301 GCTGCCTAAA GCXXAAACGC GCCCGACCAC CCATCTATAG TCCCATCATT 
CGACGGATTT CGCGTTTGCG CGGGCliSGTG GGTAQATATC AGSGTAGTAA 

28351 GTGCTACACC CAAACAATGA TGGAATCCAT AGATTGGACG GACTCAAACA 
CACGATGTGG GTTTGTOACT ACCTTAGGTA TCTAACCTGC CTGACTTTGT 
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28401 CATGTTCTTT T^bTTACAG TATGATTAAA TGAGACATGA rTCCTC^mft 
GTACAAGAAA a3!RaTGTC ATACTAATTT ACTCTGTACT AAGGAGCTCA 

2B451 TTTTATATTA CTGACCCTTG TTGCGCTTTT TTGTGCGTGC TCCACATTGG 
AAAATATAAT GACTGGGAAC AACGCGAAAA AACACGCACG ACGTGTAACC 



28501 CTGCGGTTTC TCACATCGAA GTAGACTGCA TTCCAGCCTT CACAGTCTAT 
GACGCCAAAG AGTGTAGCTT CATCTGACGT AAGGTCGGAA GTGTCAGATA 

28551 TTGCTTTACG GATTTGTCAC CCTCACGCTC ATCTGCAGCC TCATCACTGT 
AACGAAATGC CTAAACAGTG GGAGTGCGAG TAGACGTCG6 AGTAGTGACA 

28601 GGTCATCGCC TTTATCCA6T GCATTGACT6 G6TCTGTGTG CGCTTTGCAT 
CCAGTAGCGG AAATAGGTCA CGTAACTGAC CCAGACACAC GCGAAACGTA 

28651 ATCTCAGACA CCATCCCCAG TACA6QGACA GGACTATAGC TCAGCTXCTT 
TAGAGTCTGT CGTAGGGGTC ATCTCOCTGT CCTGATATCG ACTCGAAGAA 

28701 AGAATTCTTT AATTATGAAA TTTACTGTCA CTnTCTGCT GATTATTTGC 
TCTTAAGAAA TTAATACTTT AAATGACACT GAAAAGACGA CTAATAAACG 

28751 ACCCTATCTC CGTTTTGTTC CCCGACCTCC AA6CCTCAAA 6ACATATATC 
TGGGATAGAC QCAAAACAAG GGGCTGGAGG TTCGGAGTTT CTGTATATAG 

28801 ATGCAGATTC ACTCCTATAT GGAATATTCC AAGTTQCTAC AATG AAAA AA 
TACGTCTAAG TGAGCATATA CCTTATAAGG TTCAACGATG TTACTTTTTT 

28851 GCGATCTTTC CGAAGCCTGG TTATATGCAA TCATCTCTGT TAT6GTGTTC 
CGCTAGAAAG GCTTCGGACC AATATAC6TT AGTAGA6ACA ATACCACAAG 

28901 TGCAGTACCA TCTrAGCCCT AGCTATATAT CCCTACCTTG ACATTGGCTG 
ACGTCATGGT. AGAATCGGGA TCGATATATA GGGATGGAAC TGTAACCGAC 

28951 GAACGCAATA GATGCCATGA ACCACCCAAC TTTCCCCGCG CCCGCTATOC 
CTTGCGTTAT CTACGGTACT TGGTGGGTTG AAAGGGGCGC GGGCGATACG 

29001 TTCCACTGCA ACA«i.GTTGTT GCCGGCGGCT TTGTCCCAGC CAATCAGCCT 
AAGGTGACGT TGTTCAACAA CGGCCGCCGA AACAGGGTCG GTTAGTCGGA 

29D51 CGCCCACCTT CTCCCACCCC CACTGAAATC AGCTACTTTA ATCTAACAGG 
GCGGGTGGAA GAG(3GTGCCG GTCACTTTAG TCGATGAAAT TAGATTGTCC 

29101 AGGAGATGAC TGACJ^CCCTA GATCTAGAAA TGGACGGAAT TATTACAGAG 
TCCTCTACTG ACTGTGGGAT CTAGATCTTT ACCTGCCTTA ATAATGTCTC 

29151 CAGCGCCTGC TAGAAAGACG CASGGCAGCG GCCGAGCAAC AGCGCATGAA 
GTCGCG6ACG ATCTTTCTGC GTCCCGTCGC CGGCTCGTTG TCGCGTACTT 

29201 TCAAGAGCTC CAAGACATGG TTAACTTCCA CCAGTGCAAA AGGGGTATCT 
AGTTCTCGAG GTTCTGTACC AATTGAACGT GGTCACGTTT TCCCCATAGA 

29251 TTTCTCTCGT AAAGCAGGCC AAAGTCACCT ACGACAGTAA TACCACCGGA 
AAACAGAGCA TTXC6TCCGG TTTCAGTGGA TGCTGOXyiTT ATGGTGGCCT 

29301 CACCGCCTTA GCTACAAGTT GCCAACCAAG CGTCA6AAAT TGGTGGTCAT 
GTCGCGGAAT CGATGTTCAA CGGTTGGTTC GCAGTCTTTA ACCACCA6TA 
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29331 GGT6GGAGAA tiJlfg^XATTA CCATAACTCA GCACTCGGTA GAA^lCC^pS 
CCACCCTCTT TTCGGG7MT GGTATTG&GT CGTGAGCCAT CTTT6GCTTC 

29401 GCTGCATTCA CTCACCTTGT CAA66ACCTG A6GATCTCTG CACCCTTATT 
C6ACGTAAGT GAGT6GAACA GTTCCT66AC TCCTAGAGAC GTGG6AATAA 

29451 AAGACCCTCT GCGGTCTCAA AGATCTTATT CCCTTTAACT AATAAAAAAA 
TTCTGGGACA C0CCA6AGTT TCTAGAATAA Q3GAAATTGA TTATTTTTTT 

29501 AATAATAAA6 CATCACTTAC 7TAAAATCAG TTAGCAAATT TCTGTCCAGT 
TTATTATTTC GTAGTGAAT6 AATTTTA6TC AATCGTTTAA ACaACAGGTCA 

29551 ttattcagca gcacctcctt gccctcctcc cagctctggt attgcagctt 
aataagtcgt cgtggaggaa cg66aggagg 6tc6agacca taacgtcgaa 

29601 cctcctggct gca&actttc tccacaatct aaatogaatg tcagtttcct 
ogaggaccga cgtttgaaag aggtgttafia tttaccttac agtcaaagga 

29651 cctgttcct6 tccatccgca cccactatct tcatgttgtt gcagatgaag 
ggacaa6gac a6gtaggcgt gogtgataga agtacaacaa cgtctacttc 

29701 cgc6caagac cgtctgaa6a taccttcaac cccgtgtatc catatgacac 
gcgcgttctg gcagacttct atgc3aagttg gggcacatag gta'?actgtg 

29751 cgaaaccggt cctccaactg tgccttttct tactcctccc ttt6tatccc 
cctttggcca ggaggttgac acggaaaa6a at6aggaggg aaacataggg 

29801 ccaatgggtt tcaagagagt ccccctggog tactctcttt gcgcctatcc 
ggttacccaa ag7tctctca ggg6gacc0c atgagagaaa cgc6gatagg 

29851 gaacctctag ttacctccaa tggcatgctt gcgctcaaaa tgggcaacgg 

cttg'gagatc aAtggaggt t accgtAcgaa cgcgagtttt Ac'ccgttgcc 

29901 cctctctctg gacgaggccg gcaaccttac ctcccaaaat gtaaccactg 

GGAGAGAGAC CTGCTCCGGC CGTTGGAATX3.GAGGGTTTTA CATTGGTGAC 

29951 TGAGCCCACC TCTCAAAAAA ACCAAGTCAA ACATAAACCT GGAAATATCT 
ACTCGGGTGG AC5AGTTTTTT TGGTTCAGTT TGTATTTGGA CCTTTATAGA 

30001 GCACCCCTCA CAGTTACCTC AGAAGCCCTA ACTGTGGCTG CCGCCGCACC 
CGTGGGGAGT GTCAATGGAG TCTTCGGGAT TGACACCGAC GGCGGCGTGG 

30051 TCTAATGGTC GCGGGCAACA CACTCACCAT GCAATCACAG GCCCCGCTAA 
AGATTACCA6 CGCCCGTTGT GTGAGTGGTA CGTTAGTGTC CGGGGCGATT 

30101 CCGTGCACGA CTCCAAACTT AGCATTGCCA CCCAAGGACC (XTCACAGTG 
GGCACGTGCT 6AGGTTT6AA TC6TAACGGT GGGTTCCTGG GGAGTGTCAC 

30151 TCAGAAGGAA AGCTAGCCCT CCAAACATCA GCCCCCC1CA CCACCACCGA 
AGTCTTCCTT TCGATC6GGA CGTTTGTAGT CCQGGGGAGT GGTGGTCGCT 

30201 TAGCAGTACC CTTACTATCA CTGCCTCACC CCCTCTAACT ACTGCCACTG 
ATCGTCATGG 6AATGATAGT GACGGAGTGG GGGAGATTGA TGACGGTGAC 

30251 GTAGCTTGGG CATTGACTTG AAAGAGCCCA TTTATACACA AAATGGAAAA 
CATCGAACCC GTAACTGAAC TTTCTCGGGT AAATATGTGT TTTACCTTTT 
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30301 CTACGACTAA ji|Jk:GGGGC TCCTTTGCAT GTAACAGACG^^^CC^AgMJC 
GATCCTGATT" raBoCCCCG AGGAAACGTA CATTGTCT6C TGGATT^G 

30351 TTTGACCGTA GCAACTGGTC CAGGTGTGAC TATTAATAAT ACTTCCIITGC 
AAACTGfGCAT CGTTGACCAG GTCCACaCTG ATAATTATTA TGAAGGAACG 

30401 AAACTAAAGT TACTGCSAGCC TTGGGTTTTG ATTCACAAGG CAATATGCAA 
TTTGATTTCA ATGACCTCGG AACCCAAAAC TAAGTGTTCC GTTATACGTT 

30451 CTTAATGTAG CAGGA6GACT AAGGAT7GAT TCTCAAAACA GACGCCTTAT 
GAATTACATC GTCCTCCTGA rPCCTAACTA AGAGTTTTGT CTGCGGAATA 

30501 ACTTGATGTT AGTTATCCGT TTGATGCTCA AAACCAACTA AATCTAAGAC 
' TGIJ^CTACAA TCAATM36CA AACTAOGAGT TTTGGTTGAT TTAGATTCTG 

30551 TAGGACAGGG CCCTCTTTTT ATAAACTCAG CCCACAACTT GGATATTAAC 
ATCCTGTCCC GGGASAAAAA ZArTTGAGTC GGGTGTTGAA CCTATAATTG 

30601 TACAACAAAG GCCTTTACTT GTTTACA6CT TCAAACAATT CCAAAAAGCT 
ATGTTGTTTC CGGAAATGA;^ CAAATG7CGA AGTTTGTTAA GGTTTTTCGA 

30651 TGAGGTTAAC CTAAGCACT6 CCAAGGGGTT GATGTTTGAC GCTACAGCCA 
ACTCCAATTG GATTCGTGAC GGTTCCCCAA CTACAAACTC CGATGTCGGT 

3 07 01 TAGCCATTAA TCCAGGAGAT GGGCTTGAAT TTGGTTCACC TAATGCACCA 
ATCGGTAATT ACGTCCTCTA CCCGAftCTTA AACCAAGTGG ATTACGTGGT 

30751 AACACAAATC CCCTCAAAAC AAAAATTGGC CATGGCCTAG AATTTGATTC 
TTGTGTTTAG GGGAGTTTTG TTTTTAACCG GTACCGGATC TTAAACTAAG 

30BD1 AAACAAGGCT ATGGTTCCTA AACTAGGAAC TGGCCTTAGT TTTGACAGCA 
TTTGTTCCGA TACCAAGGAT TTGATCCTTG ACCQGAATCA AAACTGTCGT 

30 851 CAGGTGCCAT TACAGTA66A AACAAAAATA ATGATAAGCT AACTTTGTGG 
6TCCACGGTA ATGTCATCCT TTGTTTT7AT TACTATTCGA TTGAAACACC 

30901 ACCACACCAG CTCCATCTCC TAACTCTAGA CTAAATGCAG AGAAAGATGC 
TGGTGTGGTC GAG6TAGAG6 ATTGACATCT GATTTACGTC TCTTTCTACG 

30951 TAAACTCACT TTGGTCTTAA CAAAATGTGG CAGTCAAATA CTTOCTACAG 
ATTTGAGTGA AACCAGAATT CTTTTACACC CTCAGTTTAT GAACGATGTC 

31001 TTTCAGTTTT GGCTGTTAAA GGCAGTrTGG CTCCAATATC TGGAACAGTT 
AAAGTCAAAA CCGACAATTT CCGTCAAACC GAGGTTATAG ACCTTGTCAA 

31051 CAAAGTGCTC ATCTTATTAT AAGATTTGAC GAAAATGGAG TGCTACTAAA 
GTTTCACGA6 TAGAATAATA TTCTAAACTG CTTTTACCTC ACGATGATTT 

31101 CAATTCCTTC CTGGACCCAG AATATTGGAA CTTTAGAAAT GGAGATCTTA 
GTTAAGGAAG GACCTGG6TC TTATAACCTT GAAATCTTTA CCTCTAGAAT 

31151 CTGAAGQCAC AGCCTATACA AACGCTGTTG GATTTATGCC TAACCTATCA 
GACTTCCGTG TCGGATATGT TTGCGACAAC CTAAATACGG AOTCGATAGT 

31201 GCTTATCCAa' AATCTCACGG TAAAACTGCC AAAAGTAACA TTOTCAGTCA 
CGAATAGGTT TTAGAGTGCC ATTTTGACGG TTTTCATTGT AACAGTCAGT 
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31251 AGTTTACTTA ABICGAGACA AAACTAAACC TGTAACACTA ACCATtSPJc 
TCAAATGAAT TT13CCTCTGT TTTGATTTGG ACATTGTGAT TGGTAATGTG 

313 01 TAAACGGTAC ACAGGAAACA GGAGACACAA CTCCAAGTGC ATACTCTATG 
ATTTGCCATG TGTCCTTTGT CCTCTGTGTT GAGGTTCACG TATGAGATAC 

31351 TCATTTTCAT GGGACTGGTC TGGCCACAAC TACATTAATG AAATATTTGC 
AGTAAAA6TA CCCTGACCAG ACCGGTGTTG ATGTAATTAC TTTATAAACG 

31401 CACATCCrCT TACACTTTTT CATACATTGC CCAAGAATAA AGAATCGrTT 
GTGTAGGAGA ATGTGAAAAA GTAT6TAACG GGTTCTTATT TCTTAGCAAA 

31451 GTGTTAT6TT TCAACGTGTT TATTTTTCAA TTGCAGAAAA TTTCAAGTCA 
CACAATACAA AGT7GCACAA ATAAAAAGTT AACGTCTTTT AAAGTTCAGT 

31501 TOTTTCATTC AGTAGTATAG CCCCACCACC ACATAGCTTA TACAGATCAC 
AAAAAGTAAG TCA?CATATC GG6GTGG7GG TGTATCGAAT ATGTCTAGTG 

31551 CGTACCTTAA TCAAACTCAC AGAACCCTAG TATTCAACCT GCCACCTCCC 
6CATGGAATT AGTTTGAQTG TCTTGGGATC ATAAGTTGGA CGGTGGAGGG 

31601 TCCCAACACA CAGAGTACAC AGTCCTTTCT CCCCGGCrGG CCTTAAAAAG 
AGGGTTGTGT GTCTCATGTG TCAGGAAAGA GGGGCCGACC GGAATTTTTC 

31-651 CATCATATCA TGGGTAACAG ACATATTCTT AGGTGTTATA TTCCACACGG 
GTAGTATAGT ACCCATTGTC TGTATAAGAA TCCACAATAT AAGGTGTGCC 

31701 TTTCCTGTCG AGCCAAACGC TCATCAGTGA TATTAATAAA CTCCCCGGGC 
AAAGGACAGC TCGGnTGCG AGTAGTCACT ATAATTATTT GAGGGGCCCG 

31751 AGCTCACTTA AGTTCAT6TC- GCTGTCCAGC TGCTGAGCCA: CAGGCTGC.TG _„ 
TCGAGTGAAT TCAAGTACAG CGACAGGTCG ACGACTC6GT GTCCGACGAC 

31801 TCCAACTTGC GGTTGCTTAA CGGGCGGCGA AGGAGAAGTC CACGCCTACA 
AG6TTGAACG CCAACGAATT GCCCGCCGCT TCCTCTTCAG GTGCGGAT6T 

31851 TGGGGGTAGA GTCATAATCG TGCATCA6GA TAGGGCGGTG GTGCTGCAGC 
ACCCCCATCT CAGTATTAGC ACGTACTCCT ATCCCGCCAC CACGACCTCG 

31901 AGCGCGCGAA TAAACTGCTG CCGCCGCCGC TCCGTCCTGC AGGAATACAA 
TCGCGCGCTT ATTTGACGAC 6GCGGC6GCG AGGCA6GACG TCCTTAT6TT 

31951 CATG6CAGTG GTCTCCTCAG CGATGATTCG CACCGCCCGC AGCATAAGGC 
GTACCGTCAC CAGAGGAGTC GCTACTAAGC GTGGCGGGCG TCGTATTCCG 

32001 GCCTTGTCCT CCG06CACAG CAGCGCACCC T6ATCTCACT TAAATCAGCA 
CGGAACAGGA GGCCXTGTGTC 6TCGC6TGGG ACTAGAGTGA A7TTAGTCGT 

32051 CAGTAACTGC AGCACAGCAC CACAATATTG TTCAAAATCC CACAGTGCAA 
GTCATTGACG TCGT6TC6TG GTGTTATAAC AAGTTTTAG6 GTGTCACGTT 

32101 GGCGCTGTAT CCAAAGCTCA TGGCG6G6AC CACAGAACCC ACGTGGCCAT 
CCGCGACATA GGTTTCGAGT. ACCGCCCCTG GTGTCTTGG6 TGCACCGGTA 

32151 CATACCACAA GCGCAGGTAG ATTAA6TG6C aCCCCTCAT AAACACGCTG 
GTATGGTGTT CGCGTCCATC TAATTCACCG CTGGGGAGTA rTTTGTGCGAC 
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32201 GACATAAACA ^^CTCTTT TGGCATGrTG TAATTCACCA CCTCCO^^A 
CTGTATTTGT. AATGGAGAAA ACCGTACAAC ATTAAGTGGT GGAGGGCCAT 

32251 CCATATAAAC CTCTGATTAA ACATCGCGCC ATCCACCACC ATCCTAAACC 
GGTATATTTG GAGACTAATT TGTACCGCGG TAGGTGGTGG TAGGATTTGG 

32301 AGCTOGCCAA AACCTGCCCG CCGGCTATAC ACTGCAGGGA ACCGGGACTG 
TCGACCGGTT TTGGACGGGC GGCCGATATG TGACGTCCCT TGGCCCTGAC 

32351 GAACAATGAC AGTGGA6AGC CCAGGACTCG TAACCATGGA TCATCATGCT 
CTTGTTACTG TCACCTCTCG GGTCCTGAGC ATTGGTACCT AGTAGTACGA 

32401 CGTCATGATA TCAATGTTGG CACAACACAG GCACACGTGC ATACACTTCC 
GCAGTACTAT AGTTACAACC GTGTTGTGTC C6TGTGCACG TATGTGAAGG 

32451 TCAGCATTAC AAGCTCCTCC CGCGTTAGAA CCATATCCCA GGGAACAACC 
AGTCCTAATG TTCGAGGAGG GCGCAATCTT GGTATAGGGT CCCTTGTTGG 

32501 CATTCCTGAA TCAGCGTAAA TCCCACACTG CAGGGAAGAC CTCGCACGTA 
GTAAGGACTT ACTCGCATTT AGGGTGTGAC GTCCCTTCTG GAGCGTGCAT 

32551 ACTCACGTTG TGCATTGTCA AAGTGTTACA TTCGGGCAGC AGCGGATGAT 
TGAGTGCAAC ACGTAACAGT TTCACAATGT AAGCCCGTCG TCGCCTACTA 

32601 CCTCCAGTAT GGTAGCGCGG GTTTCTGTCT CAAAAGGAGG TAGACGATCC 
GGAGGTCATA CCATCGCGCC CAAAGACAGA GTTTTCCTCC ATCTGCTAGG 

32651 CTACTGTACG GAGTGCGCCG AGACAACCGA GATCGTGTTG 3TCGTAGTGT 
GATGACATGC CTCACGCGGC TCTGTT63CT CTAGCACAAC CAGCATCACA 

32701 CATGCCAAAT GGAACGCCGG ACGTAGTCAT ATTTCCTGAA QCAAAACCAG 
GTACGGTTTA CCTTGCGGCC TGCATCAGTA TAAAGGACTT CXJTTTTGGTC 

32751 GTGCGGGCGr GACAAACAGA TCTGCGTCTC CGGTCTCGCC GCTTAGATC6 
CACGCCCGCA CTGTTTGTCT AGACGCA3A6 GCCAGAGCGG CGAATCTAGC 

32801 CTCTGTGTAG TAGTTGTAGT ATATCCACTC TCTCAAAGCA TCCAGGCGCC 
GAGACACATC ATCAACATCA TATAGGTGAG AGAGTTTCGT AGGTCCGCGG 

3 2 851 CCCTGGCTTC GGGTTCTATG TAAACTCCTT CATGCGCCGC TGCCCTGATA 
GGGACCGAAG CCCAAGATAC ATTTGAGGAA GTACGCGGCG AC66GACTAT 

32901 ACATCCACCA CCGCAGAATA AGCCACACCC AGCCAACCTA CACATTCGTT 
TCTAGGTGGT GGCGTCTTAT TCGG7GTGGG 7CGGTTGGAT STGTAA6CAA 

32951 CT6CGAGTCA CACACG6GAG GAGCGGGAAG A6CTGGAAGA ACCATGTTTT 
GACGCTCAGT GTGTGCCCTC CTCGCCCTTC TCGACCTTCT TG6TACAAAA 

33001 TTTTT T TATT CCAAAAGATT ATCCAJkAACC TCAAAATQAA QATCTATTAA 
AAAAAAATAA GGTTTTCTAA TAGGTTTXGG AGTTTTACTT CTAGATAATT 

33051 GTGAACGCGC TCCCCTCCOG T6GCGTGGTC AAACTCTACA GCCAAAGAAC 
CACTTGCGCG AGGQ6AG6CC ACCGCACCA6 TTTGAGATGT CGGTTTCTT6' 

33101 AGATAATGGC ATTT6TAAGA TGTTGCACAA TGGCTTCCAA AAGGCAAACG 
TCTATTACCQ TAAACATTCT ACAACGTGTT ACCGAA6GTT TTCCGTTTGC 
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3315X 
33201 
33251 
33301 
33351 
33401 
33451 
33501 
33551 
33601 
33651 
33701 
33751 
33801 
33B51 
33901 



GCCCTCACGT ( fvaA grGGAC GTAAAGGCTA AACCCTTCffltS ^TCSUSS^Sr 
CGGGAGTGCA ^SMtCACCTG CATTTCCGAT TTGGGAAGTC Ca«kOTr!BRG 

■ 

CTCTATAAAC ATTCCAGCAC CTTCAACCAT GCCCAAATAA TTCTCATCTC 
6AGATATTTG TAAGGTCGTG GAAGTTGGTA CGGGTTTATT AAGA6TA6AG 

GCCACCTTCT CAATATATCT CTAAGCAAAT CCCGAATATT AA3TCCGGCC 
CGGTGGAAGA GTTATATAGA GATTCGTTTA GGGCTTATAA TTCAGGCCGG 

ATTGTAAAAA TCTGCTCCAG AGCGCCCTCC ACCTTCAGCC TCAAGCAGCG 
TAACATTTTT AGAC6AGGTC TCGCGGGAGG TGGAACTCGG AGTTCGTCGC 

AATCATGATT QCAAAAATTC AGGTTCCTCA CAGACCTGTA TAAGATTCAA 
TTAGTACTAA CGTTTTTAAG TCCAAGGAGT GTCTGGACAT ATTCTAAGTT 

AAGCGGAACA TTAACAAAAA TACCGCGATC CCGTAGGTCC CTTCGCAGGG 
TTCGCCTTGT AATTGTTTTT ATGGCGCTAG GGCATCCAGG GAAGCGTCCC 

CCAGCTGAAC ATAATCGTGC AGGTCTGCAC 6GACCAGCGC GGCCACTTCC 
G6TCGACTTG TATTAGCACG TCCAGACGTG CCTGGTCGCG CCGGTGAAGG 

CCGCXTAGGAA CCATGACAAA AGAACCCACA CT6ATTATGA CACGCATACT 
GGCGGTCCTT GGTACTGTTT TCTTGGGTGT GACTAATACT GTGCGTATGA 

CGGAGCTATG CTAACCAGCG TAGCCCCGAT GTAAGCTTGT TGCATGGGCG 
GCCTCGATAC GATTGGTCGC ATCGGGGCTA CATTCGAACA ACGTACCCGC 

GCGATATAAA ATGCAAGGTG CTGCTCAVlA AATCAGGCAA AGCCTCGCGC 
CGCTATATTT TACGTTCCAC GACGAGTTTT TTAGTCCGTT TCG6AGCGCG 

AAAAAAGAAA GCA'CATCGTA GTCATGCTCA TGCAGATAAA GGCA6GTAAG 
TTTTTTCTTT CGT3TAGCAT ^CAGTACGAGT ACGTCTATTT CCGTCCATTC 

CTCCGGAACC ACCACAGAAA AAGACACCAT TTTTCTCTCA AACATGTCTG 
6AGGCCTTG6 TGGTGTCTTT TTCTGTG3TA AAAAGAGAGT TTGTACAGAC 

CGGGTTTCTG CATAAACACA AAATAAAATA ACAAAAAAAC ATTTAAACAT 
GCCCAAAGAC GTATTTGTGT TTTATTTTAT TGTTTTTTTG TAAATTTGTA 

TAGAAGCCTG TCTTACAACA GGAAAAACAA CCCTTATAAG CATAAGACGG 
ATCTTCGGAC AGAATGTTGT CCTTTTTGTT GGGAATATTC GTATTCTGCC 

ACTACGGCCA TGCCGGCGTG ACCGTAAAAA AACTGGTCAC CGTGATTAAA 
TGATGCCGGT ACQGCCGCAC T6GCATTTTT TTGACCAGTG GCACTAATTT 

AAGCACCACC GACAGCTCCT CGGTCATCTC CGGAGTCATA ATGTAAGACT 
TTC6TGGTGG CTGTCGAGGA GCCAGTACAG GCCTCAGTAT 3ACATTCT6A 

CGGTAAACAC ATCAG6TTGA TTCACATCGG TCAGTGCTAA AAAGCGACCG 
GCCATTTGTG TAGTCCAACT AA6TGTAGCC AGTCACGATT TTTCGCTGGC 

AAATAGCCCG 6GGGAATACA TACCCGCAGG CGTAGAGACA ACATTACAGC 
TTTATCGGGC CC0CTTAT6T ATGGGCGTCC GCATCTCTGT TGTAATGTCG 

CCCCATAGGA GGTATAACAA AATTAATAGG AGAGAAAAAC ACATAAACAC 
GG6GTATCCT CCATATTGTT TT7ATTATCC TCTCTTTTTG TCTATTTGTG 
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34101 CTGAAAAACCiCJ^TGCCTA GGCAAAATAG CACCCTCCge-^€TeC3i CgDaB l 
GACTTTTTOG GJ^BACGGAT CCGTTTTATC GTGGGAGGGC GAGGTC1BRT 

34151 ACATACAGCG CTTCCACAQC GGCAGCCATA ACAGTCAGCC TTACCAGTAA 
TGTATGTCGC GAAGGTGTCG CCGTCGGTAT TGTCAGTCGG AATSSGTCATT 

34201 AAAAGAAAAC CTATTAAAAA AACACCACTC GACACGGCAC CAGCTCAATC 
TTTTCTTTTG GATAATTTTT TTGTGGTGAG CTGTGCCGTG QTCGAGTTAG 

34251 A6TCACAGTG TAAAAAAGGG CCAAG7X3CAG AGCGAGTATA TATAGGACTA 
TCAGTGTCAC ATTTTTTCCC GGTTCACCTC TCGCTCATAT ATATCCTGAT 

34301 AAAAATGACG TAAC6GTTAA AGTCCACAAA AAACACCCAG AAAAC CGCAC 
TTTTTACTGC ATTGCCAATT TCAGGTGTTT TTTGTGGGTC TTTTGGCGTG 

34351 GCGAACCTAC GCCCAGAAAC GAAA6CCAAA AAACCCACAA CTTCCTCAAA 
CGCTTGGATG CGGGTCTTTG CTTTCQGTTT TTTGGCTGTT GAAGCA6TTT 

34401 TCGTCACTTC CGTTTTCCCA CGTTACGTCA CTTCCCATTT 7AAGAAAACT 
AGCAGTGAAG GCAAAAGGGT GCAATGCAGT 6AAGGGTAAA ATPCTTTTGA 

34451 ACAATTCCCA ACACATACAA GTTACTCCGC CCTAAAACCT AC3TCACCCG 
TGTTAAGGGT TGT6TATGTT CAATGA6GCG GGATTTTGGA TGCAGTGGGC 

4 

34501 CCCCGTTCCC ACGCCCCGCG CCACGTCACA AACTCCACCC CCTCATTATC 
GGGGCAAGGG TGCGGGGCGC GGTGCAGTGT TTGAGGTGGG GGAGTAATAG 



34551 ATATTGGCTT CAATCCAAAA TAAGGTATAT TATTGATGAT GTTAATTAAG 
TATAACC6AA GTTAGGTTTT ATTCCATATA ATAACTACTA CAATTAATTC 

34601 AATTCGGATC TGCGACGCGA GGCTGGATGG CCTTCCCCAT TATGATTCTT 
TTAAGCCTAG ACGCTGCGCT CCGACCTACC GGAAGGGGTA ATACTAAGAA 

34651 CTCGCTTCCG GCGGCJiTCGG GATGCCCSCG TTGCAGGCCA rGCTGTCCAG 
GAGCGAAGGC CGCCGTAGCC CTACGGGCGC AACGTCCGGT ACXSACAGGTC 

34701 GCAGGTAGAT GACGACCATC AGGGACAGCT TCAAGGCCAG CAAAAGGCCA 
CGTCCATCTA CTCCTGCTAG TCCCTGTCGA AGTTCCGGTC GTTTTCCGGT 

34751 GGAACCGTAA AAAGGCCGCG TTGCTGGCGT TTTTCCATAG GCTCCGCCCC 
CCTTGGCATT TTTCCGGCGC AACGACCGCA AAAAGGTATC CGAGGCGGGG 

34801 CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT 6GCGAAACCC 
GGACTGCTCG TAGTGTTTTT AGCTGC6AGT TCAGTCTCCA CCGCTTTGGG 

34B51 GACAGGACTA TAAAGATACC AGGCX3TTTCC CCCTGGAAGq TCCCTCGTGC 
CTGTCCTGAT ATTTCTATGG TCCGCAAAGG GGGACCOTCC AGGGAGCACG 

34901 GCTCTCCTGT TCCGACCCTG CCGCTTACCG GATACCTGTC CGCCTTTCTC 
CGAGAGSACA AGGCTGGGAC GGCGAATSGC CTATGGACAG GCGGAAAGAG 

34951 CCTTCG36AA GCGTOGCGC?r TTCTCATAGC TCAC6CTGTA GGTATCTCA6 
GGAAGCCCTT CGCACCGCGA AAGA6TATCG AGTOCGACAT CCATAGAGTC 

35001 TTCGGTGTA6 GTCGTTCGCT CCAAGCTGGG CTGTGTGCAC GAACCCCCCG 
AAGCCACATC CTlGCAAGCGA GGTTCGACXrC GACACACGTG CTTGGGGGGC 
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35051 TTCAGCCCGA CCGCTGCGCC TTATCCGGTA ACTATCGTCT TGAGTCCSaC 
AAGTCGGGC7 GGCGACGC6G AATAGGCCAT TGATAGCAGA ACTCA6GTTG 

35101 CCGGTAAGAC ACGACTTATC GCCACTGGCA GCAGCCACTG GTAACAGGAT 
GGCCATTCTG TGCTGAATAG CGGTGACCGT CGTCGGTGAC CATTGTCCTA 

35151 TAGCAGAGCG AGGTATGTAG GCGGTGCTAC AGAGTTCTTG AAGTGGTGGC 
ATCGTCTCGC TCCATACATC CGCCACGAT6 TCTCAAGAAC TTCACCACCG 

35201 CTAACTACGG CTACACTAGA A6GIACAGTAT TTGGTATCTG CGCTCTGCT6 
GATTGAT6CC GATGTGATCT TCCTGTCATA AACCATAGAC 6CGAGACGAC 

35251 AAGCCAGTTA CCTTCGC3AAA AAGAGTTGGT AGCTCT7GAT CCGGCAAACA 
TTCGGTCAAT GGAAGCCTTT TTCTCAACCA TCGAGAACTA GGCCGTTTGT . 

35301 AACCACCGCT GGTAGC6GTG GTTTTTTTGT TT6CAAGCAG CAGATTACGC 
TTGGTGGCGA CCATCGCCAC CAAAAAAACA AACGTTCGTC GTCTAATGCG 

35351 GCAGAAAAAA AGGATCTCAA 6AAGATCCTT T6ATCTTTTC TACGGGGTCT 
CGTCTTTTTT TCCTAGAGTT CTTCTAGGAA ACTAGAAAAG ATGCCCCAGA 

35401 GACGCTCAGT GGAACGAAAA CTCACGTTAA G6GATTTTGG TCATGAGATT 
CTGCGAGTCA CCTTGCTTTT GAGTGCAATT CCCTAAAACC AGTACTCTAA 

35451 ATCAAAAAGG ATCTTCACCT AGATCCTTTT AAATCAATCT AAAGTATATA 
TAGTTTTTCC TAGAAGTGGA TCTAGGAAAA TTTA6TTAGA TTTCATATAT 

35501 TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATCAGT GAGGCACCTA 
ACTCATTTGA ACCAGACTCT CAATCGTTAC 6AATTAGTCA CTCCGTGGAT 

35551 TCTCAGCGAT CTGTCTATTT CGTTCATCCA TAGTTGCCTG ACTCCCCGTC 
AGAGXCGCTA GACAGATAAA GCAAGTAGCT ATCAACGGAC TGAGGGGCAG 

35601 GTGTAGATAA CTACGATACG GGAGGGCTTA CCATCTGGCX: CCAGTGCTGC 
CACATCTATT GATGCTATGC CCTCCCGAAT GGTAGACCGG GGTCACGACG 

35651 AATGATACCG CGAGACCCAC GCTCACCGGC TCCAGATTTA TCAGCAATAA 
TTACTATGGC GCTCTGGGTG CGAGTGGCCG AGGTCTAAAT AGTCGTTATT 

35701 ACCAGCCAGC C6GAAGGGCC GAGCGCAGAA GTGGTCCTGC AACTTTATCC 
TG6TCGGTCG GCCTTCCCGG CTCGCGTCTT CACCAGGACG TTGAAATAGG 

35751 GCCTCCATCC AGTCTATTAA TTGTTGCCGG GAAGCTAGAG TAAGTAGTTC 
CG6AQG7AGG TCAGATAATT AACAACGGCC CTTCGATCTC ATTCATCAAG 

35801 GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTACA GGCATCGTGG 
CGGTCAATTA TCAAAC6C6T TGCAACAACG GTAACGATGT CC6TAGCACC 

35851 TGTCACGCTC GTC6TTTGGT ATGGCTTCAT TCAGCTCCGG TTCCCAACGA 
ACAGTGCGAG CAGCAAACCA TACCGAAGTA AGTCGAGGCC AAGGGTTGCT 

35901 TCAAGGCGAG TTACATGATC CCCCATGTTG TGCAAAAAAG CGGTTAGCTC 
AGTTCCGCTC AATGTACTAG GGGGTACAAC ACGTTTTTTC GCCAATCGAG 

35951 CTTCGGTCCT CCGATCGTTG TCA6AAGTAA GTTGGCCGCA GTGTTATCAC 
GAAGCCAGGA G6CTAGCAAC AGTCTTCATT CAACC6GCGT CACAATAGTG 
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36001 tcatggttatIHRgcactg cataattctc ttactgtcat GCCAT^R^A 
agtaccaata ccgtc6t6ac gtattaagag aatgacagta cggtaggcat 

36051 agatgctttt ct6tgactgg tgagtactca accaagtcat. tctgagaata 
tctacgaaaa gacactgacc actcatgagt tggttca6ta agactcttat 

36101 gtgtatgcgg cgaccgagtt gctcttgccc ggcgtcaaca cgggataata 
cacatacgcc gctggctcaa cgagaacggg ccgcagttgt gccctattat 

36151 ccgcgccaca tagcagaact ttaaaagtgc tcatcattgg aaaacgttct 

GGCGCGGTGT ATCGTCTTGA AATTTTCACG AGTAGTAACC TTTTGCAAGA 

36201 TCGGGGCGAA AACTCTCAAG GATCTTACCG CTGTTGA6AT CCAGTTCGAT 
AGCCCCGCTT TTGAGAGTTC CTAGAATGGC GACAACTCTA GGTCAAGCTA 

36251 GTAACCCACT CGTGCACCCA ACTGATCTTC A6CATCTTTT ACTTTCACCA 
CATTGGGTGA GCAOGTOGGT TGACTAGAAG TCGTAGAAAA TGAAAGTGGT 

36301 - GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC AAAATGCCGC AAAAAAGGGA 
CGCAAAGACC CACTCGTTTT TGTCCTTCCG TTTTACGGCG TTTTTTCCCT 

* 

36351 ATAAGGGCGA CACGGAAATG TTGAATACTC ATACTCTTCC TTTTTCAATA 
TATTCCCGCT GTGCCTOTAC AACTTATGAG TATGAGAAGG AAAAA6TTAT 

36401 TTATTGAAGC ATTTATCAGG GTTATTGTCT CAtGAGCGGA TACATATTTG 
AATAACTTCG TAAATAGTCC CAATAACAGA GTACTCGCCT ATGTATAAAC 

36451 AATGTATTTA GAAAAATAAA CAAATAGGGG TTCCGCGCAC ATTTCCCCGA 
TTACATAAAT CTTTTTATTT GTTTATCCCC AAGGCGCGTG TAAAGGGGCT 

36501 AAAGTGCCAC CTGACGTCTA AGAAACCATT ATTATCATGA CATTAACCTA 
TTTCACGGTG GACTGCAGAT TCTTTGGTAA TAAXAGTACT GTAATTGGAT 

* 

36551 TAAAAATAGG CGTATCACGA GGCCCTTTCG TCTTCAAGAA TTGGATCCGA 
ATTTTTATCC GCATAGTGCT CCGGGAAAGC AGAAGTTCTT AACCTAGGCT 



Pad 



36601 



ATTCTTAATT TCTTAATTAA (SEQ ID NO: 34) 
TAAGAATTAA AGAATTAATT (SEQ ID NO: 35) 
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AEX*HPLC GeD-Assocrated Viral Particle Concentration 
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atg gat gca atg aag aga ggg etc tgc tgt gtg ctg ctg ctg tgt gga 48 
Met Asp Ala Met Lys Arg Gly Leu Cys Cys Val l^eu Leu Leu Cys Gly . 

15 10 15 

gca gtc ttc gtt teg ccc age gag ate tec att gtg tgg gee tec agg 96 

Ala Val Phe Val Ser Pro Ser Glu lie Ser lie Val Trp Ala Ser Arg 

20 25 .30 

gag ctg gag agg ttt get gtg aac cct ggc ctg ctg gag aec tct gag 144 

Glu Leu Glu Arg Phe Ala Val Asn Pro Gly Leu Leu Glu Thr Ser Glu 

35 40 45 

ggg tgc agg cag ate ctg ggc cag etc cag ccc tec ctg caa aca ggc 192 

Gly Cys Arg Gin He Leu Gly Gin Leu Gin Pro Ser Leu Gin Thr Gly 

50 55 60 

tct gag gag ctg agg tec ctg tac aac aca gtg get ace ctg tac tgt 240 
Ser Glu Glu Leu Arg Ser 1j€u Tyr Asn Thr Val Ala Thr Leu Tyr Cys 

65 70 75 80 

gtg cac cag aag att gat gtg aag gac ace aag gag gee ctg gag aag 288 

Val His Gin Lys He Asp Val Lys Asp Thr Lys Glu Ala Leu Glu Lys 

85 90 95 

att gag gag gag cag aac aag tee aag aag aag gee cag cag get get 336 

He Glu Glu Glu Gin Asn Lys Ser Lys Lys Lys Ala Gin Gin Ala Ala 

100 105 110 

get ggc aca ggc aac tec age cag gtg tec cag aac tac ccc att gtg 384 

Ala Gly Thr Gly Asn Ser Ser Gin Val Ser Gin Asn Tyr Pro He Val 

115 120 125 

cag aac etc cag ggc cag atg gtg cac cag gee ate tec ccc egg ace 432 

Gin Asn Leu Gin Gly.- Gin Met , Val His Gin Ala He Ser Pro Arg Thr 

130 135 .140 

ctg aat gee tgg gtg aag gtg gtg gag gag aag gee ttc tec cct gag 480 

Leu Asn Ala Trp Val Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu 

145 150 155 160 

gtg ate ccc atg ttc tct gee ctg tct gag ggt gee ace ccc cag gac 528 

Val He Pro Met Phe Ser Ala Leu Ser Glu Gly Ala Thr Pro Gin Asp 

165 170 175 

ctg aac acc atg ctg aac aca gtg ggg ggc cat cag get gee atg cag 576 

Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala Ala Met Gin 

180 185 190 

♦ 

atg ctg aag gag aec ate aat gag gag get get gag tgg gac agg ctg 624 

Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg Leu 

195 200 205 

cat cct gtg cac get ggc ccc att gee ccc ggc cag atg agg gag ccc 672 
His Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu Pro 

210 215 220 

agg ggc tct gac att get ggc ace ace tec ace etc cag gag cag att 720 

Arg Gly Ser Asp He Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He 

225 230 235 240 

ggc tgg atg acc aac aac ccc ccc ate cct gtg ggg gaa ate tac aag 768 

Gly Trp Met Thr Asn Asn Pro Pro He Pro Val Gly Glu He Tyr Lys 

245 250 255 
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agg tgg ate ate ctg ggc ctg aac aag att gtg agg atg tac tec ccc 
Arg Trp lie He Leu Gly Leu Asn Lys He Val Arg Met Tyr Ser Pro 

260 265 270 

acc tec ate ctg gac ate agg cag ggc ccc aag gag ccc ttc agg gac 
Thr Ser He Leu Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp 
275 280 285 



816 



gag gtg aag aac tgg atg aca gag acc ctg ctg gtg cag aat gcc aac 
Glu Val Lys Asn Trp Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn 
305 310 315 320 

ect gac tgc aag acc ate ctg aag gcc ctg ggc cct get gcc acc ctg 
Pro Asp Cys Lys Thr He Leu Lys Ala Leu Gly Pro Ala Ala Thr Leu 

325 330 335 



tgc ttc aac tgt ggc aag gtg ggc cac att gcc aag aac tgt agg gee 
Cys Phe Asn Cys Gly Lys Val Gly His He Ala Lys Asn Cys Arg Ala 
385* 390 395 400 



864 



tat gtg gac agg ttc tac aag acc ctg agg get gag cag gcc tec cag 912 
Tyr Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser Gin 
290 295 300 



960 



1008 



gag gag atg atg aca gcc tgc cag ggg gtg ggg ggc cct ggt cac aag 1056 

Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro Gly His Lys 

340 345 350 

gcc agg gtg ctg get gag gcc atg tec cag gtg acc aac tec gcc acc 1104 

Ala Arg Val Leu Ala Glu Ala Wet Ser Gin Val Thr Asn Ser Ala Thr 

355 360 365 

ate atg atg cag agg ggc aac ttc agg aac cag agg aag aca gtg aag 1152 

He Met Met Gin Arg Gly Asn Phe Arg Asn Gin Arg Lys Thr Val Lys 
370 375 380 



1200 



ccc agg aag aag ggc tgc tgg aag tgt ggc aag gag ggc cac cag atg 1248 
Pro Arg Lys Lys Gly Cys Trp Lys Cys Gly Lys Glu Gly His Gin Met 

405 410 415 



1296 



1344 



aag gac tgc aat gag agg cag gcc aac ttc ctg ggc aaa ate tgg ccc 
Lys Asp Cys Asn Glu Arg Gin Ala Asn Phe Leu Gly Lys He Trp Pro 

420 425 430 

tec cac aag ggc agg cct ggc aac ttc etc cag tec agg cct gag ccc 

Ser His Lys Gly Arg Pro Gly Asn Phe Leu Gin Ser Arg Pro Glu Pro 

435 440 445 

aca gee cct ccc gag gag tec ttc agg ttt ggg gag gag aag acc acc 1392 

Thr Ala Pro Pro Glu Glu Ser Phe Arg Phe Gly Glu Glu Lys Thr Ohr 
450 455 460 

ccc age cag aag cag gag ccc att gac aag gag ctg tac ccc ctg gcc 1440 

Pro Ser Gin Lys Gin Glu Pro He Asp Lys Glu Leu Tyr Pro Leu Ala 
465 470 475 480 

tec ctg agg tec ctg ttt ggc aac gac ccc tec tee cag taa (SID NO: 36) 1482 

Ser Leu Arg Ser Leu Phe Gly Asn Asp Pro Ser Ser Gin * (SID NO: 37) 

465 490 
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ATGGGTGCTA GGGCTTCTGT GCTGTCTGGT 
CTGA6GCCTG GTGGCAAGAA GAAGTACAAG 
CTGGAGAGGT TTGCTGTGAA CCCTGGCCTG 
CTGGGCCAGC TCCAGCCCTC CCTGCAAACA 
ACAGTGGCTA CCCTGTACTG TGTGCACCAG 
CTGGAGAAGA TTGAGGAGGA GCAGAACAAG 
GGCACAGGCA ACTCCAGCCA GGTGTCCCAG 
CAGATGGTGC ACCAGGCCAT CTCCCCCCGG 
GAGAAGGCCT TCTCCCCTGA GGTGATCCCC 
CCCCAGGACC TGAACACCAT GCTGAACACA 
CTGAAGGAGA CCATCAATGA GGAGGCTGCT 
GGCCCCATT6 CCCCCGGCCA GATGAGGGAG 
TCCACCCTCC AGGAGCAGAT TGGCTGGATG 
ATCTACAAGA GGTGGATCAT CCTGGGCCTG 
TCCATCCTGG ACATCAGGCA GGGCCCCAAG 
TACAAGACCC TGAGGGCTGA GCAGGCCTCC 
CTGCTGGTGC AGAATGCCAA CCCTGACTGC 
GCCACCCTGG AGGAGATGAT GACAGCCTGC 
AGGGTGCTGG CTGAGGCCAT GTCCCAGGTG 
GGCAACTTCA™t3GAACCAGAG"*GAAGACAGTG 
ATTGCCAAGA ACTGTAGGGC CCCCAGGAAG 
CACCAGATGA AGGACTGCAA TGAGAGGCAG 
CACAAGGGCA GGCCTGGCAA CTTCCTCCAG 
GAGTCCTTCA GGTTTGGGGA GGAGAAGACC 
AAGGAGCTGT ACCCCCTGGC CTCCCTGAGG 
ATGGCTCCCA TCTCCCCCAT TGAGACTGTG 
CCCAAGGTGA AGCAGTGGCC CCTGACTGAG 
ACTGAGATGG AGAA6GAGGG CAAAATCTCC 
CCTGT6TTTG CCATCAAGAA GAAGGACTCC 
GAGCT6AACA AGAGGACCCA GGACTTCTGG 
GGCCTGAAGA AGAAGAAGTC TGTGACTGTG 
CCCCTGGATG AGGACTTCAG GAAGTACACT 
ACCCCTGGCA TCA6GTACCA GTACAATGTG 
ATCTTCCAGT CCTCCATGAC CAAGATCCTG 
GTGATCTACC AGTACATGGC TGCCCTGTAT 
AGGACCAAGA TTGAGGAGCT GAGGCAGCAC 
AAGAAGCACC A6AAGGAGCC CCCCTTCCTG 
TGGACTGTGC AGCCCATTGT GCTGCCTGAG 
AAGCTGGTGG GCAAGCTGAA CTGGGCCTCC 
CTGTGCAAGC TGCTGAGGGG CACCAAGGCC 
GCTGAGCTGG AGCTGGCTGA GAACAGGGAG 



FIGURE 33A 

GGTGAGCTGG ACAAGTGGGA GAAGATCAGG 
CTAAAGCACA TTGTGTGGGC CTCCAGGGAG 
CTGGAGACCT CTGAGGGGTG CAGGCAGATC 
GGCTCTGAGG AGCTGAGGTC CCTGTACAAC 
AAGATTGATG TGAAGGACAC CAAGGAGGCC 
TCCAAGAAGA AGGCCCAGCA 6GCTGCT6CT 
AACTACCCCA TTGTGCAGAA CCTCCAGGGC 
ACCCTGAATG CCTGGGTGAA GGTGGTGGAG 
ATGTTCTCTG CCCTGTCTGA GGGTGCCACC 
GTGGGGGGCC ATCAGGCTGC CATGCAGATG 
GAGTGGGACA GGCTGCATCC TGTGCACGCT 
CCCAGGGGCT CTGACATTGC TGGCACCACC 
ACCAACAACC CCCCCATCCC TGTGGGGGAA 
AACAAGATTG TGAGGATGTA CTCCCCCACC 
GAGCCCTTCA GGGACTATGT GGACAGGTTC 
CAGGAGGTGA AGAACTGGAT GACAGAGACC 
AAGACCATCC TGAAGGCCCT GGGCCCTGCT 
CAGGGGGTGG GGGGCCCTGG TCACAAGGCC 
ACCAACTCCG CCACCATCAT GATGCAGAGG 
AAGTGCTTCA ACTGTGGCAA GGTGGGCCAG - — - — 
AAGGGCTGCT GGAAGTGTGG CAAG6AGGGC 
GCCAACTTCC TGGGCAAAAT CTGGGCCTCC 
TCCAGGCCTG AGCCCACAGC CCCTCCCGAG 
ACCCCCAGCC AGAAGCAGGA GCCCATTGAC 
TCCCTGTTTG GCAACGACCC CTCCTCCCAG 
CCTGTGAAGC TGAAGCCTGG CATGGATGGC 
GAGAAGATCA AGGCCCTGGT GGAAATCTGC 
AAGATTGGCC CCGAGAACCC CTACAACACC 
ACCAAGTGGA GGAAGCTGGT GGACTTCAGG 
GAGGTGCAGC TGGGCATCCC CCACCCCGCT 
CTGGCTGTGG GGGATGCCTA CTTCTCTGTG 
GCCTTCACCA TCCCCTCCAT CAACAATGAG 
CT6CCCCAGG GCTGGAAGGG CTCCCCTQCC 
GAGCCCTTCA GGAAGCAGAA CCCTGACATT 
GTGGGCTCTG ACCTGGAGAT T6GGCAGCAC 
CT6CTGAGGT GG6GCCTGAC CACCCCT6AC 
TGGATGGGCT AT6AGCTGCA CCCCGACAAG 
AAGGACTCCT GGACTGT6AA TGACATCCAG 
CAAATCTACC CTGGCATCAA GGTGAGGCAG 
CTGACTGAGG TGATCCCCCT GACTGAGGAG 
ATCCT6AAGG AGCCTGTGCA TGGGGTGTAC 
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TATGACCCCT CCAAGGACCT 
TACCAAATCT ACCAGGAGCC 
GGGGCCCACA- CCAATGATGT 
TCCATTGTGA TCTGGGGCAA 
GAGACCTGGT G6ACTGAGTA 
ACCCCCCCCC TGGTGAAGCT 
ACCTTCTATG TGGCTGGGGC 
ACCAACAGGG GCAGGCAGAA 
CTCCAGGCCA TCTACCTGGC 
TCCCAGTATG CCCTGGGCAT 
AACCAGATCA TTGAGCAGCT 
CACT^GGGCA TTGG6GGCAA 
GTGCTGTTCC TGGATGGCAT 
TGGAGGGCTA TGGCCTCTGA 
TCCTGTGACA AGTGCCAGCT 
GGCATCTGGC AGCTGGCCTG 
GTGGCCTCCG GCTACATTGA 
TACTTCCTGC TGAAGCTGGC 
TCCAACTTCA CTGGGGCCAC 
TTTGGCATCC CCTACAACCC 
AAGAAGATCA TTGGGCAGGT 
GCTGTGTTCA TCCACAACTT 
AGGATTGTGG ACATCATTGC 
AAGATCCAGA ACTTCAGGGT 
GCCAAGCTGC TGTGQAAGGG 
GTGGTGCCCA GGAGGAAGGC 
6ACTGTGTGG CCTCCAGGCA 
SEQ ID NO: 38 



FIGURE 33B 

GATTGCTGAG ATCCAGAAGC 
CTTCAAGAAC CTGAAGACTG 
GAAGCAGCTG ACTGAGGCTG 
GACCCCCAA6 TTCAAGCTGC 
CTGGCAGGCC ACCTOSATCC 
GTGGTACCAG. CTGGAGAAGG 
TGCCAACAGG 6A6ACCAAGC 
GGTGGTGACC CT6ACTGACA 
CCTCCAGGAC TCTGGCCTGG 
CATCCAGGCC CAGCCTGATC 
GATCAAGAAG GAGAAGGTGT 
TGAGCAGGT6 6ACAAGCTGG 
TGACAAGGCC CAGGATGAGC 
CTTCAACCTG CCCCCTGTGG 
GAAGGGGGAG GCCATGCATG 
CACCCACCTG GAGGGCAAGG 
GGCTGAGGTG ATCCCTGCTG 
TGGCAGGTGG CCTGTGAAGA 
AGTGAGGGCT GCCTGCTGGT 
CCAGTCCCAG GGGGTGGTGG 
GAGGGACCAG GCTGAGCACC 
CAAGAGGAAG GGGG6CATCG 
CACAGACATC CAGACCAAGG 
GTACTACAGG GACTCCAGGA 
GGAGGGG6CT GTGGTGATCC 
CAAGATCATC AGGGACTATG 
GGATGAGGAC TAA 



AGGGCCAGGG CCAGTGGACC 
GCAAGTATGC CAGGATGAGG 
TGCAGAAGAT CACCACTGAG 
CCATCCAGAA GGAGACCTGG 
CTGA6TGGGA GTTTGTGAAC 
AGCCCATTGT GGGGGCTGAG 
TGGGCAAGGC TGGCTATGTG 
CCACCAACCA GAi^GACTGCC 
AGGTGAACAT TGTGACTGCC 
AGTCTGAGTC TGAGCTGGTG 
ACCTGGCCTG GGTGCCTGCC 
TGTCTGCTGG CATCAGGAAG 
ATGAGAAGTA CCACTCCAAC 
TGGCTAAGGA GATTGTGGCC 
6GCAGGTGGA CTGCTCCCCT 
TGATCCTGGT GGCTGTGCAT 
AGACAGGCCA GGAGACTGCC 
CCATCCACAC TGCCAATGGC 
GGGCTGGCAT CAAGCAGGAG 
CCTCCAT6AA CAAGGA6CTG 
TGAAGACAGC TGTGCAGATG 
GGGGCTACTC C6CT66GGAG 
AGCTCCAGAA GCAGATCACC 
ACCCCCTGTG GAAGGGCCCT 
AGGACAACTC TGACATCAAG 
GCAAGCAGAT GGCTGGGGAT 
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FIGURE 34A 
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Lys 


Ala 


Phe Ser Pro 


Glu 


Val 


He 


Pro 


Met 


Phe 


Ser 


^ ^ 

Ala 


Leu 


Ser 


Glu 


Gly Ala 


Thr 


Pro 


Gin Jisp Leu 


Asn 


Thr 


Met 


Leu 
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Lys 


Tyr 


Thr 


Ala 


Phe Thr He 


Pro 


Ser 


He 


Asn 


Asn 


Glu 


Thr 


Pro 


Gly 


He Arg 


Tyr 


Gin 


Tyr 


Asn Val Leu Pro Gin Gly Trp 
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Asp Lys Trp Thr Val Gin 
Asn Asp lie Gin Lys Leu 
Gly lie Lys Val Arg Gin 
Glu Val lie Pro hen Thr 
lie Leu Lys Glu Pro Val 
Ala Glu lie Gin Lys Gin 
Pro Phe Lys Asn Leu Lys 
Asn Asp Val Lys Gin Leu 
Val He Trp Gly Lys Thr 
Glu Thr Trp Trp mir Glu 
Val Asn Thr Pro Pro Leu 
Val Gly Ala Glu Thr Phe 
Gly Lys Ala Gly Tyr Val 
Asp Thr Thr Asn Gin Lys 
Ser Gly Leu Glu Val Asn 
Gin Ala Gin Pro Asp Gin 
Leu He Lys Lys Glu Lys 
Gly Gly Asn Glu Gin Val 
Phe Leu Asp Gly He Asp 
Trp Arg Ala Met Ala Ser 
Val Ala Ser Cys Asp Lys 
Asp Cys Ser Pro Gly He 
.He Leu Val Ala Val His 
Ala Glu Thr Gly Gin Glu 
Pro Val Lys Thr He His 
Arg Ala Ala Cys Trp Trp 
Pro Gin Ser Gin Gly Val 
Gly Gin Val Arg Asp Gin 
Phe He His Asn Phe Lys 
Arg He Val Asp He He 
He Thr Lys He Gin Asn 
Trp Lys Gly Pro Ala Lys 
Asp Asn Ser Asp He Lys 
Tyr Gly Lys Gin Met Ala 

SEQ ID NO: 39 



FIGURE 34B 
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Lys Asp Ser Trp Thr Val 
Ala Ser Gin He Tyr Pro 
Gly Thr Lys Ala Leu Thr 
Leu Ala Glu Asn Arg Glu 
Pro Ser Lys Asp Leu He 
Tyr Gin He Tyr Gin Glu 
Met Arg Gly Ala His Thr- 
He Thr Thr Glu Ser He 
He Gin Lys Glu Thr Trp 
He Pro Glu Trp Glu Phe 
Leu Glu Lys Glu Pro lie 
Asn Arg Glu Thr Lys Leu 
Lys Val Val Thr Leu Thr 
Tyr Leu Ala Leu Gin Asp 
Tyr Ala Leu Gly He He 
Asn Gin He He Glu Gin 
Pro Ala His Lys Gly He 
Gly He Arg Lys Val Leu 
Glu Lys Tyr His Ser Asn 
Val Val Ala Lys Glu He 
Ala Met His Gly Gin Val 
His Leu Glu Gly Lys Val 
Glu Ala Glu Val He Pro 
Lys Leu Ala Gly Arg Trp 
Phe Thr Gly Ala Thr Val 
Phe Gly He Pro Tyr Asn 
Glu Leu Lys Lys He He 
Ala Val Gin Met Ala Val 
Gly Tyr Ser Ala Gly Glu 
Lys Glu Leu Gin Lys Gin 
Asp Ser Arg Asn Pro Leu 
Gly Ala Val Val He Gin 
Ala Lys He He Arg Asp 
Ser Arg Gin Asp Glu Asp 
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